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Preface 


This book is primarily intended to accompany an undergraduate course 
in atomic physics. It covers the core material and a selection of more 
advanced topics that illustrate current research in this field. The first 
six chapters describe the basic principles of atomic structure, starting 
in Chapter 1 with a review of the classical ideas. Inevitably the dis- 
cussion of the structure of hydrogen and helium in these early chapters 
has considerable overlap with introductory quantum mechanics courses, 
but an understanding of these simple systems provides the basis for the 
treatment of more complex atoms in later chapters. Chapter 7 on the 
interaction of radiation with atoms marks the transition between the 
earlier chapters on structure and the second half of the book which cov- 
ers laser spectroscopy, laser cooling, Bose-Einstein condensation of di- 
lute atomic vapours, matter-wave interferometry and ion trapping. The 
exciting new developments in laser cooling and trapping of atoms and 
Bose-Einstein condensation led to Nobel prizes in 1997 and 2001, respec- 
tively. Some of the other selected topics show the incredible precision 
that has been achieved by measurements in atomic physics experiments. 
This theme is taken up in the final chapter that looks at quantum infor- 
mation processing from an atomic physics perspective; the techniques 
developed for precision measurements on atoms and ions give exquisite 
control over these quantum systems and enable elegant new ideas from 
quantum computation to be implemented. 

The book assumes a knowledge of quantum mechanics equivalent to an 
introductory university course, e.g. the solution of the Schrodinger equa- 
tion in three dimensions and perturbation theory. This initial knowledge 
will be reinforced by many examples in this book; topics generally re- 
garded as difficult at the undergraduate level are explained in some de- 
tail, e.g. degenerate perturbation theory. The hierarchical structure of 
atoms is well described by perturbation theory since the different layers 
of structure within atoms have considerably different energies associated 
with them, and this is reflected in the names of the gross, fine and hyper- 
fine structures. In the early chapters of this book, atomic physics may 
appear to be simply applied quantum mechanics, i.e. we write down the 
Hamiltonian for a given interaction and solve the Schrodinger equation 
with suitable approximations. I hope that the study of the more ad- 
vanced material in the later chapters will lead to a more mature and 
deeper understanding of atomic physics. Throughout this book the ex- 
perimental basis of atomic physics is emphasised and it is hoped that 
the reader will gain some factual knowledge of atomic spectra. 


vl 


Preface 


The selection of topics from the diversity of current atomic physics 
is necessarily subjective. I have concentrated on low-energy and high- 
precision experiments which, to some extent, reflects local research in- 
terests that are used as examples in undergraduate lectures at Oxford. 
One of the selection criteria was that the material is not readily avail- 
able in other textbooks, at the time of writing, e.g. atomic collisions 
have not been treated in detail (only a brief summary of the scattering 
of ultracold atoms is included in Chapter 10). Other notable omissions 
include: X-ray spectra, which are discussed only briefly in connection 
with the historically important work of Moseley, although they form an 
important frontier of current research; atoms in strong laser fields and 
plasmas; Rydberg atoms and atoms in doubly- and multiply-excited 
states (e.g. excited by new synchrotron and free-electron laser sources); 
and the structure and spectra of molecules. 

I would like to thank Geoffrey Brooker for invaluable advice on physics 
(in particular Appendix B) and on technical details of writing a textbook 
for the Oxford Master Series. Keith Burnett, Jonathan Jones and An- 
drew Steane have helped to clarify certain points, in my mind at least, 
and hopefully also in the text. The series of lectures on laser cooling 
given by William Phillips while he was a visiting professor in Oxford was 
extremely helpful in the writing of the chapter on that topic. The fol- 
lowing people provided very useful comments on the draft manuscript: 
Rachel Godun, David Lucas, Mark Lee, Matthew McDonnell, Martin 
Shotter, Claes-Gé6ran Wahlstr6m (Lund University) and the (anony- 
mous) reviewers. Without the encouragement of S6nke Adlung at OUP 
this project would not have been completed. Irmgard Smith drew some 
of the diagrams. I am very grateful for the diagrams and data supplied 
by colleagues, and reproduced with their permission, as acknowledged 
in the figure captions. Several of the exercises on atomic structure de- 
rive from Oxford University examination papers and it is not possible to 
identify the examiners individually—some of these exam questions may 
themselves have been adapted from some older sources of which I am 
not aware. 

Finally, I would like to thank Professors Derek Stacey, Joshua Silver 
and Patrick Sandars who taught me atomic physics as an undergraduate 
and graduate student in Oxford. I also owe a considerable debt to the 
book on elementary atomic structure by Gordon Kemble Woodgate, who 
was my predecessor as physics tutor at St Peter’s College, Oxford. In 
writing this new text, I have tried to achieve the same high standards 
of clarity and conciseness of expression whilst introducing new examples 
and techniques from the laser era. 


Background reading 


It is not surprising that our language should be incapable of 
describing the processes occurring with the atoms, for it was 
invented to describe the experiences of daily life, and these 
consist only of processes involving exceeding large numbers 


of atoms. Furthermore, it is very difficult to modify our 
language so that it will be able to describe these atomic pro- 
cesses, for words can only describe things of which we can 
form mental pictures, and this ability, too, in the result of 
daily experience. Fortunately, mathematics is not subject to 
this limitation, and it has been possible to invent a mathe- 
matical scheme—the quantum theory—which seems entirely 
adequate for the treatment of atomic processes. 

From The physical principles of the quantum theory, Werner 
Heisenberg (1930). 


The point of the excerpt is that quantum mechanics is essential for a 
proper description of atomic physics and there are many quantum me- 
chanics textbooks that would serve as useful background reading for this 
book. The following short list includes those that the author found par- 
ticularly relevant: Mandl (1992), Rae (1992) and Griffiths (1995). The 
book Atomic spectra by Softley (1994) provides a concise introduction to 
this field. The books Cohen-Tannoudji et al. (1977), Atkins (1983) and 
Basdevant and Dalibard (2000) are very useful for reference and contain 
many detailed examples of atomic physics. Angular-momentum theory 
is very important for dealing with complicated atomic structures, but 
it is beyond the intended level of this book. The classic book by Dirac 
(1981) still provides a very readable account of the addition of angular 
momenta in quantum mechanics. A more advanced treatment of atomic 
structure can be found in Condon and Odabasi (1980), Cowan (1981) 
and Sobelman (1996). 


Oxford Ga Jake 


Web site: 
http://www.physics.ox.ac.uk/users/foot 


This site has supplementary information and corrections found after 
going to press. 
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Early atomic physics 


1.1 Introduction 


The origins of atomic physics were entwined with the development of 
quantum mechanics itself ever since the first model of the hydrogen 
atom by Bohr. This introductory chapter surveys some of the early 
ideas, including Einstein’s treatment of the interaction of atoms with 
radiation, and a classical treatment of the Zeeman effect. These meth- 
ods, developed before the advent of the Schrédinger equation, remain 
useful as an intuitive way of thinking about atomic structure and tran- 
sitions between the energy levels. The ‘proper’ description in terms of 
atomic wavefunctions is presented in subsequent chapters. 

Before describing the theory of an atom with one electron, some ex- 
perimental facts are presented. This ordering of experiment followed 
by explanation reflects the author’s opinion that atomic physics should 
not be presented as applied quantum mechanics, but it should be mo- 
tivated by the desire to understand experiments. This represents what 
really happens in research where most advances come about through the 
interplay of theory and experiment. 


1.2 Spectrum of atomic hydrogen 


It has long been known that the spectrum of light emitted by an element 
is characteristic of that element, e.g. sodium in a street lamp, or burn- 
ing in a flame, produces a distinctive yellow light. This crude form of 
spectroscopy, in which the colour is seen by eye, formed the basis for a 
simple chemical analysis. A more sophisticated approach using a prism, 
or diffraction grating, to disperse the light inside a spectrograph shows 
that the characteristic spectrum for atoms is composed of discrete lines 
that are the ‘fingerprint’ of the element. As early as the 1880s, Fraun- 
hofer used a spectrograph to measure the wavelength of lines, that had 
not been seen before, in light from the sun and he deduced the exis- 
tence of a new element called helium. In contrast to atoms, the spectra 
of molecules (even the simplest diatomic ones) contain many closely- 
spaced lines that form characteristic molecular bands; large molecules, 
and solids, usually have nearly continuous spectra with few sharp fea- 
tures. In 1888, the Swedish professor J. Rydberg found that the spectral 
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‘The Swiss mathematician Johann 
Balmer wrote down an _ expression 
which was a particular case of eqn 1.1 
with n = 2, a few years before Jo- 
hannes (commonly called Janne) Ry- 
dberg found the general formula that 
predicted other series. 


a spectrum of the Balmer series of 
lines is on the cover of this book. 


3In this book transitions are also spec- 
ified in terms of their frequency (de- 
noted by f so that f = ci), or in elec- 
tron volts (eV) where appropriate. 


4 Air absorbs radiation at wavelengths 
shorter than about 200nm and so 
spectrographs must be evacuated, as 
well as being made with special optics. 


lines in hydrogen obey the following mathematical formula: 


(1.1) 


where n and n’ are whole numbers; R is a constant that has become 
known as the Rydberg constant. The series of spectral lines for which 
n = 2 and n’ = 3,4,... is now called the Balmer series and lies in the 
visible region of the spectrum. The first line at 656nm is called the 
Balmer-a (or Hg) line and it gives rise to the distinctive red colour of 
a hydrogen discharge—a healthy red glow indicates that most of the 
molecules of Hy have been dissociated into atoms by being bombarded 
by electrons in the discharge. The next line in the series is the Balmer-f 
line at 486nm in the blue and subsequent lines at shorter wavelengths 
tend to a limit in the violet region.? To describe such series of lines it is 
convenient to define the reciprocal of the transition wavelength as the 
wavenumber / that has units of m~! (or often cm"), 


ae 

A 
Wavenumbers may seem rather old-fashioned but they are very useful 
in atomic physics since they are easily evaluated from measured wave- 
lengths without any conversion factor. In practice, the units used for 
a given quantity are related to the method used to measure it, e.g. 
spectroscopes and spectrographs are calibrated in terms of wavelength.® 
A photon with wavenumber v has energy E = hcv. The Balmer for- 
mula implicitly contains a more general empirical law called the Ritz 
combination principle that states: the wavenumbers of certain lines in 
the spectrum can be expressed as sums (or differences) of other lines: 
b3 = 1 t V2, e.g. the wavenumber of the Balmer-@ line (n = 2 ton! = 4) 
is the sum of that for Balmer-a (n = 2 to n’ = 3) and the first line in 
the Paschen series (n = 3 to n’ = 4). Nowadays this seems obvious 
since we know about the underlying energy-level structure of atoms but 
it is still a useful principle for analyzing spectra. Examination of the 
sums and differences of the wavenumbers of transitions gives clues that 
enable the underlying structure to be deduced, rather like a crossword 
puzzle—some examples of this are given in later chapters. The observed 
spectral lines in hydrogen can all be expressed as differences between 
energy levels, as shown in Fig. 1.1, where the energies are proportional 
to 1/n*. Other series predicted by eqn 1.1 were more difficult to observe 
experimentally than the Balmer series. The transitions to n = 1 give 
the Lyman series in the vacuum ultraviolet region of the spectrum.* The 
series of lines with wavelengths longer than the Balmer series lie in the 
infra-red region (not visible to the human eye, nor readily detected by 
photographic film—the main methods available to the early spectroscop- 
ists). The following section looks at how these spectra can be explained 
theoretically. 


(1.2) 
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1.3. Bohr’s theory 


In 1913, Bohr put forward a radical new model of the hydrogen atom 
using quantum mechanics. It was known from Rutherford’s experiments 
that inside atoms there is a very small, dense nucleus with a positive 
charge. In the case of hydrogen this is a single proton with a single elec- 
tron bound to it by the Coulomb force. Since the force is proportional 
to 1/r?, as for gravity, the atom can be considered in classical terms as 
resembling a miniature solar system with the electron orbiting around 
the proton, just like a planet going around the sun. However, quantum 
mechanics is important in small systems and only certain electron orbits 
are allowed. This can be deduced from the observation that hydrogen 
atoms emit light only at particular wavelengths corresponding to tran- 
sitions between discrete energies. Bohr was able to explain the observed 
spectrum by introducing the then novel idea of quantisation that goes 
beyond any previous classical theory. He took the orbits that occur in 
classical mechanics and imposed quantisation rules onto them. 

Bohr assumed that each electron orbits the nucleus in a circle, whose 
radius r is determined by the balance between centripetal acceleration 
and the Coulomb attraction towards the proton. For electrons of mass 
me and speed v this gives 


MeV e 


=e La 
r Aregr2 i 


In SI units the strength of the electrostatic interaction between two 
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Fig. 1.1 The energy levels of the hydro- 
gen atom. The transitions from higher 
shells n’ = 2,3,4,... down to then = 1 
shell give the Lyman series of spectral 
lines. The series of lines formed by 
transitions to other shells are: Balmer 
(n= 2), Raschens (7 — 3) se erack= 
ett (n = 4) and Pfund (n = 5) (the 
last two are not labelled in the figure). 
Within each series the lines are denoted 
by Greek letters, e.g. La for n = 2 to 
f= il awl lly torn = 4 ham = % 
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5 Older systems of units give more suc: 
cinct equations without 47€9; some of 
this neatness can be retained by keep- 
ing e?/4r€9 grouped together. 


charges of magnitude e is characterised by the combination of constants 
e?/4me€9.° This leads to the following relation between the angular fre- 
quency w = v/r and the radius: 


> _ e7/Ame9 (1.4) 
Mer> — 
This is equivalent to Kepler’s laws for planetary orbits relating the square 
of the period 27/w to the cube of the radius (as expected since all steps 
have been purely classical mechanics). The total energy of an electron 
in such an orbit is the sum of its kinetic and potential energies: 


He omy) ee 


(1.5) 
: 


Using eqn 1.3 we find that the kinetic energy has a magnitude equal 
to half the potential energy (an example of the virial theorem). Taking 
into account the opposite signs of kinetic and potential energy, we find 


(1.6) 


This total energy is negative because the electron is bound to the proton 
and energy must be supplied to remove it. To go further Bohr made the 
following assumption. 


Assumption I There are certain allowed orbits for which the electron 
has a fixed energy. The electron loses energy only when it jumps between 
the allowed orbits and the atom emits this energy as light of a given 
wavelength. 


That electrons in the allowed orbits do not radiate energy is contrary 
to classical electrodynamics—a charged particle in circular motion un- 
dergoes acceleration and hence radiates electromagnetic waves. Bohr’s 
model does not explain why the electron does not radiate but simply 
takes this as an assumption that turns out to agree with the experi- 
mental data. We now need to determine which out of all the possible 
classical orbits are the allowed ones. There are various ways of doing this 
and we follow the standard method, used in many elementary texts, that 
assumes quantisation of the angular momentum in integral multiples of 
h (Planck’s constant over 27): 


meur = nh, - (iF 


where n is an integer. Combining this with eqn 1.3 gives the radii of the 
allowed orbits as 

r=agn’, (1.8) 
where the Bohr radius ao is given by 

h? 

(e7/47r€9) Me 
This is the natural unit of length in atomic physics. Equations 1.6 and 
1.8 combine to give the famous Bohr formula: 


ao = (1.9) 


- (1.10) 


The positive integer n is called the principal quantum number.® 
Bohr’s formula predicts that in the transitions between these energy 
levels the atoms emit light with a wavenumber given by 


1 1 
This equation fits very closely to the observed spectrum of atomic hy- 


drogen described by eqn 1.1. The Rydberg constant Ro, in eqn 1.11 is 
defined by 


(1.11) 


(e?/4r€0) Me 
2h 


The factor of he multiplying the Rydberg constant is the conversion fac- 
tor between energy and wavenumbers since the value of R. is given 
in units of m~! (or cm~! in commonly-used units). The measure- 
ment of the spectrum of atomic hydrogen using laser techniques has 
given an extremely accurate value for the Rydberg constant’ R, = 
10 973 731.568 525 m~!. However, there is a subtle difference between 
the Rydberg constant calculated for an electron orbiting a fixed nucleus 
R. and the constant for real hydrogen atoms in eqn 1.1 (we originally 
wrote R without a subscript but more strictly we should specify that 
it is the constant for hydrogen Ry). The theoretical treatment above 
has assumed an infinitely massive nucleus, hence the subscript oo. In 
reality both the electron and proton move around the centre of mass of 
the system. For a nucleus of finite mass M the equations are modified 
by replacing the electron mass me by its reduced mass 


heRoo = (1.12) 


meM 
—— iets 
au Me + M ( ) 
For hydrogen 
My Me 
= Roo —— ~ Ro (1- ; 1.14 
cs ar M, ( ie) ud 


where the electron-to-proton mass ratio is me/Mp ~ 1/1836. This 
reduced-mass correction is not the same for different isotopes of an el- 
ement, e.g. hydrogen and deuterium. This leads to a small but readily 
observable difference in the frequency of the light emitted by the atoms 
of different isotopes; this is called the isotope shift (see Exercises 1.1 and 
le) 


1.4 Relativistic effects 


Bohr’s theory was a great breakthrough. It was such a radical change 
that the fundamental idea about the quantisation of the orbits was at 
first difficult for people to appreciate—they worried about how the elec- 
trons could know which orbits they were going into before they jumped. 
It was soon realised, however, that the assumption of circular orbits is 
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©The alert reader may wonder why 
this is true since we introduced n in 
connection with angular momentum in 
eqn 1.7, and (as shown later) elec- 
trons can have zero angular momen- 
tum. This arises from the simplifica- 
tion of Bohr’s theory. Exercise 1.12 dis- 
cusses a more satisfactory, but longer 
and subtler, derivation that is closer to 
Bohr’s original papers. However, the 
important thing to remember from this 
introduction is not the formalism but 
the magnitude of the atomic energies 
and sizes. 


’This is the 2002 CODATA recom- 
mended value. The currently accepted 
values of physical constants can be 
found on the web site of the National 
Institute of Science and Technology 
(NIST). 
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8This has a simple interpretation in 
terms of the de Broglie wavelength 
associated with an electron Agujp = 
h/mev. The allowed orbits are those 
that have an integer multiple of de 
Broglie wavelengths around the circum- 
ference: 2n7r = ndAgp, i.e. they are 
standing matter waves. Curiously, this 
idea has some resonance with modern 
ideas in string theory. 


®We neglect a factor of 3 in the bino- 
mial expansion of the expression for 7 


at low speeds, v?/c? < 1. 


10 An electron in the Bohr orbit with 
nm = 1 has speed ac. Hence it has linear 
momentum meac and angular momen- 
tum Meacag = h. 


too much of an over-simplification. Sommerfeld produced a quantum 
mechanical theory of electrons in elliptical orbits that was consistent 
with special relativity. He introduced quantisation through a general 
rule that stated ‘the integral of the momentum associated with a coor- 
dinate around one period of the motion associated with that coordinate 
is an integral multiple of Planck’s constant’. This general method can 
be applied to any physical system where the classical motion is periodic. 
Applying this quantisation rule to momentum around a circular orbit 
gives the equivalent of eqn 1.7:° 


Niel < 20r = Te (1.15) 


In addition to quantising the motion in the coordinate 6, Sommerfeld 
also considered quantisation of the radial degree of freedom r. He found 
that some of the elliptical orbits expected for a potential proportional 
to 1/r are also stationary states (some of the allowed orbits have a high 
eccentricity, more like those of comets than planets). Much effort was 
put into complicated schemes based on classical orbits with quantisation, 
and by incorporating special relativity this ‘old quantum theory’ could 
explain accurately the fine structure of spectral lines. The exact details 
of this work are now mainly of historical interest but it is worthwhile 
to make a simple estimate of relativistic effects. In special relativity a 
particle of rest mass m moving at speed v has an energy 
E(v) =yme’, (1.16) 
where the gamma factor is y = 1/,/1 — v?/c?. The kinetic energy of the 
moving particle is AE = E (v) — E(0) = (y — 1) mec?. Thus relativistic 
effects produce a fractional change in energy:? 
AN ae ie 
This leads to energy differences between the various elliptical orbits of 
the same gross energy because the speed varies in different ways around 
the elliptical orbits, e.g. for a circular orbit and a highly elliptical orbit 
of the same gross energy. From eqns 1.3 and 1.7 we find that the ratio 
of the speed in the orbit to the speed of light is 


v @ 
Ae (1.18) 
where the fine-structure constant a is given by 
e*/Ameg 


This fundamental constant plays an important role throughout atomic 
physics.'° Numerically its value is approximately a ~ 1/137 (see inside 
the back cover for a list of constants used in atomic physics). From 
eqn 1.17 we see that relativistic effects lead to energy differences of 
order a” times the gross energy. (This crude estimate neglects some 
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dependence on principal quantum number and Chapter 2 gives a more 
quantitative treatment of this fine structure.) It is not necessary to go 
into all the refinements of Sommerfeld’s relativistic theory that gave 
the energy levels in hydrogen very precisely, by imposing quantisation 
rules on classical orbits, since ultimately a paradigm shift was neces- 
sary. Those ideas were superseded by the use of wavefunctions in the 
Schrodinger equation. The idea of elliptical orbits provides a connection 
with our intuition based on classical mechanics and we often retain some 
traces of this simple picture of electron orbits in our minds. However, 
for atoms with more than one electron, e.g. helium, classical models do 
not work and we must think in terms of wavefunctions. 


1.5 Moseley and the atomic number 


At the same time as Bohr was working on his model of the hydrogen 
atom, H. G. J. Moseley measured the X-ray spectra of many elements. 
Moseley established that the square root of the frequency of the emitted 
lines is proportional to the atomic number Z (that he defined as the 
position of the atom in the periodic table, starting counting at Z = 1 


for hydrogen), i.e. 
V fees. 


Moseley’s original plot is shown in Fig. 1.2. As we shall see, this equation 
is a considerable simplification of the actual situation but it was remark- 
ably powerful at the time. By ordering the elements using Z rather than 
relative atomic mass, as was done previously, several inconsistencies in 
the periodic table were resolved. There were still gaps that were later 
filled by the discovery of new elements. In particular, for the rare-earth 
elements that have similar chemical properties and are therefore difficult 
to distinguish, it was said ‘in an afternoon, Moseley could solve the prob- 
lem that had baffled chemists for many decades and establish the true 
number of possible rare earths’ (Segré 1980). Moseley’s observations can 
be explained by a relatively simple model for atoms that extends Bohr’s 
model for hydrogen.!" 

A natural way to extend Bohr’s atomic model to heavier atoms is 
to suppose that the electrons fill up the allowed orbits starting from 
the bottom. Each energy level only has room for a certain number of 
electrons so they cannot all go into the lowest level and they arrange 
themselves in shells, labelled by the principal quantum number, around 
the nucleus. This shell structure arises because of the Pauli exclusion 
principle and the electron spin, but for now let us simply consider it as an 
empirical fact that the maximum number of electrons in the n = 1 shell 
is 2, the n = 2 shell has 8 and the n = 3 shell has 18, etc. For historical 
reasons, X-ray spectroscopists do not use the principal quantum number 
but label the shells by letters: K for n = 1, L for n = 2, M forn = 3 
and so on alphabetically.!? This concept of electronic shells explains the 
emission of X-rays from atoms in the following way. Moseley produced 
X-rays by bombarding samples of the given element with electrons that 


(1.20) 


11 Tragically, Henry Gwyn Jeffreys 
Moseley was killed when he was only 
28 while fighting in the First World War 
(see the biography by Heilbron (1974)). 


12The chemical properties of the ele- 
ments depend on this electronic struc- 
ture, e.g. the inert gases have full shells 
of electrons and these stable configura- 
tions are not willing to form chemical 
bonds. The explanation of the atomic 
structure underlying the periodic ta- 
ble is discussed further in Section 4.1. 
See also Atkins (1994) and Grant and 
Phillips (2001). 
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Tore handwriting in the bottom right 
corner states that this diagram is the 
original for Moseley’s famous paper in 
Phil. Mag., 27, 703 (1914). 
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Fig. 1.2 Moseley’s plot of the square root of the frequency of X-ray lines of elements 
against their atomic number. Moseley’s work established the atomic number Z as 
a more fundamental quantity than the ‘atomic weight’ (now called relative atomic 
mass). Following modern convention the units of the horizontal scales would be 
(108 Hz) at the bottom and (10~!° m) for the log scale at the top. (Archives of the 
Clarendon Laboratory, Oxford; also shown on the Oxford physics web site.)!3 
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had been accelerated to a high voltage in a vacuum tube. These fast 
electrons knock an electron out of an atom in the sample leaving a 
vacancy or hole in one of its shells. This allows an electron from a 
higher-lying shell to ‘fall down’ to fill this hole emitting radiation of a 
wavelength corresponding to the difference in energy between the shells. 

To explain Moseley’s observations quantitatively we need to modify 
the equations in Section 1.3, on Bohr’s theory, to account for the effect 
of a nucleus of charge greater than the +1e of the proton. For a nuclear 
charge Ze we replace e?/47€9 by Ze?/47€9 in all the equations, resulting 
in a formula for the energies like that of Balmer but multiplied by a factor 
of Z?. This dependence on the square of the atomic number means that, 
for all but the lightest elements, transitions between low-lying shells lead 
to emission of radiation in the X-ray region of the spectrum. Scaling the 
Bohr theory result is accurate for hydrogenic ions, i.e. systems with 
one electron around a nucleus of charge Ze. In neutral atoms the other 
electrons (that do not jump) are not simply passive spectators but partly 
screen the nuclear charge; for a given X-ray line, say the K- to L-shell 
transition, a more accurate formula is 


1 Z-ox)? (Z-01)? 
5 = Boo | Foy - Cour | (1.21) 


The screening factors ox and oy, are not entirely independent of Z and 
the values of these screening factors for each shell vary slightly (see the 
exercises at the end of this chapter). For large atomic numbers this 
formula tends to eqn 1.20 (see Exercise 1.4). This simple approach does 
not explain why the screening factor for a shell can exceed the number 
of electrons inside that shell, e.g. ox = 2 for Z = 74 although only 
one electron remains in this shell when a hole is formed. This does not 
make sense in a classical model with electrons orbiting around a nucleus, 
but can be explained by atomic wavefunctions—an electron with a high 
principal quantum number (and little angular momentum) has a finite 
probability of being found at small radial distances. 

The study of X-rays has developed into a whole field of its own within 
atomic physics, astrophysics and condensed matter, but there is only 
room to mention a few brief facts here. When an electron is removed 
from the K-shell the atom has an amount of energy equal to its bind- 
ing energy, i.e. a positive amount of energy, and it is therefore usual 
to draw the diagram with the K-shell at the top, as in Fig. 1.3. These 
are the energy levels of the hole in the electron shells. This diagram 
shows why the creation of a hole in a low-lying shell leads to a succes- 
sion of transitions as the hole works its way outwards through the shells. 
The hole (or equivalently the falling electron) can jump more than one 
shell at a time; each line in a series from a given shell is labelled using 
Greek letters (as in the series in hydrogen), e.g. K,, Kg,.... The levels 
drawn in Fig. 1.3 have some sub-structure and this leads to transitions 
with slightly different wavelengths, as shown in Moseley’s plot. This is 
fine structure caused by relativistic effects that we considered for Som- 
merfeld’s theory; the substitution e?/47e9 > Ze?/4me9, as above, (or 
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Fig. 1.3 The energy levels of the inner 
shells of the tungsten atom (Z = 74) 
and the transitions between them that 
give rise to X-rays. The level scheme 
has several important differences from 
that for the hydrogen atom (Fig. 1.1). 
Firstly, the energies are tens of keV, 
as compared to eV for Z = 1, be 
cause they scale as Z? (approximately). 
Secondly, the energy levels are plotted 
with n = 1 at the top because when 
an electron is removed from the K-shell 
the system has more energy than the 
neutral atom; energies are shown for 
an atom with a vacancy (missing elec- 
tron) in the K-, L-, M- and N-shells. 
The atom emits X-ray radiation when 
an electron drops down from a higher 
shell to fill a vacancy in a lower shell— 
this process is equivalent to the va- 
cancy, or hole, working its way out- 
wards. This way of plotting the ener- 
gies of the system shows clearly that 
the removal of an electron from the K- 
shell leads to a cascade of X-ray tran- 
sitions, e.g. a transition between the 
nm = 1 and 2 shells gives a line in the 
K-series which is followed by a line in 
another series (L-, M-, etc.). When the 
vacancy reaches the outermost shells of 
electrons that are only partially filled 
with valence electrons with binding en- 
ergies of a few eV (the O- and P-shells 
in the case of tungsten), the transition 
energies become negligible compared to 
those between the inner shells. This 
level scheme is typical for electrons in a 
moderately heavy atom, i.e. one with 
filled K-, L-, M- and N-shells. (The 
lines of the L-series shown dotted are 
allowed X-ray transitions, but they do 
not occur following Ka emission—for 
reasons explained in Section 2.2.) 
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equivalently a + Za) shows that fine structure is of order (Za)? times 
the gross structure, which itself is proportional to Z?. Thus relativistic 
effects grow as Z* and become very significant for the inner electrons of 
heavy atoms, leading to the fine structure of the L- and M-shells seen in 
Fig. 1.3. This relativistic splitting of the shells explains why in Mose- 
ley’s plot (Fig. 1.2) there are two closely-spaced curves for the Kg-line, 
and several curves for the L-series. 

Nowadays much of the X-ray work in atomic physics is carried out 
using sources such as synchrotrons; these devices accelerate electrons by 
the techniques used in particle accelerators. A beam of high-energy elec- 
trons circulates in a ring and the circular motion causes the electrons to 
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radiate X-rays. Such a source can be used to obtain an X-ray absorption 
spectrum.'* There are many other applications of X-ray emission, e.g. 
as a diagnostic tool for the processes that occur in plasmas in fusion 
research and in astrophysical objects. Many interesting processes occur 
at ‘high energies’ in atomic physics but the emphasis in this book is 
mainly on lower energies. 


1.6 Radiative decay 


An electric dipole moment —eD oscillating at angular frequency w radi- 
ates a power!® 
e? D?w4 


= ———_... IL 
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An electron in harmonic motion has a total energy!® of E = mew? D? /2, 
where D is the amplitude of the motion. This energy decreases at a rate 
equal to the power radiated: 

e7w? E 


dE 
ee ee 1.2 
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where the classical radiative lifetime 7 is given by 


1 See 


tT  Gregmec3 ” Wes 
For the transition in sodium at_a wavelength of ee (yellow light) 
this equation predicts a value of r = 16ns ~ 10-®s. This is very close 
to the experimentally measured value and typical of allowed transitions 
that emit visible light. Atomic lifetimes, however, vary over a very wide 
range,!’ e.g. for the Lyman-a transition (shown in Fig. 1.1) the upper 
level has a lifetime of only a few nanoseconds.'®:!9 
The classical value of the lifetime gives the fastest time in which the 
atom could decay on a given transition and this is often close to the 
observed lifetime for strong transitions. Atoms do not decay faster than 
a classical dipole radiating at the same wavelength, but they may decay 
more slowly (by many orders of magnitude in the case of forbidden 


transitions).”° 
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The development of the ideas of atomic structure was linked to_exper- 
iments_on the emission, and absorption, of radiation from atoms, e.g. 
X-rays or light. The emission of radiation was considered as something 
that just has to happen in order to carry away the energy when an elec-_ 
tron_jumps from one allowed orbit, to another, but the mechanism was 
not explained.”! In one of his many strokes of genius Einstein devised a 
way of treating the phenomenon of spontaneous emission quantitatively, 


14 absorption is easier to interpret than 
emission since only one of the terms 
in eqn 1.21 is important, e.g. EK = 
hcReo(Z = ox)?. 


15-This total power equals the integral 
of the Poynting vector over a closed sur- 
face in the far-field of radiation from the 
dipole. This is calculated from the os- 
cillating electric and magnetic fields in 
this region (see electromagnetism texts 
or Corney (2000)). 


16-The sum of the kinetic and potential 
energies, 


17The classical lifetime scales as w?. 
However, we will find that the quan- 
tum mechanical result is different (see 
Exercise 1.7). 


18 Higher-lying levels, e.g. n = 30, 
live for many microseconds (Gallagher 
1994). 


19 Atoms can be excited up to config- 
urations with high principal quantum 
numbers in laser experiments; such sys- 
tems are called Rydberg atoms and 
have small intervals between their en- 
ergy levels. As expected from the cor- 
respondence principle, these Rydberg 
atoms can be used in experiments that 
probe the interface between classical 
and quantum mechanics. 


20he ion-trapping - techniques de- 
scribed in Chapter 12 can probe tran- 
sitions with spontaneous decay rates 
less than 1s~1+, using single ions con- 
fined by electric and magnetic fields— 
something that was only a ‘thought 
experiment’ for Bohr and the other 
founders of quantum theory. In par- 
ticular, the effect of individual quan- 
tum jumps between atomic energy lev- 
els is observed. Radiative decay resem- 
bles radioactive decay in that individ- 
ual atoms spontaneously emit a photon 
at a given time but taking the average 
over an ensemble of atoms gives expo- 
nential decay. 


ae complete explanation of sponta- 
neous emission requires quantum elec- 
trodynamics. 
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22This treatment of the interaction of 
atoms with radiation forms the founda- 
tion for the theory of the laser, and is 
used whenever radiation interacts with 
matter (see Fox 2001). A historical ac- 
count of Einstein’s work and its pro- 
found implications can be found in Pais 
(1982). 


23-The frequency dependence of the in- 
teraction is considered in Chapter 7. 


*4The word laser is an acronym for light 


amplification by stimulated emission of 


radiation. 


25 This lifetime was estimated by a clas- 
sical argument in the previous section. 


Fig. 1.4 The interaction of a two-level 
atom with radiation leads to stimulated 
transitions, in addition to the sponta- 
neous decay of the upper level. 


based on an intuitive understanding of the process.”” 

Einstein considered atoms with two levels of energies, E, and Ep, as 
shown in Fig. 1.4; each level may have more than one state and the 
number of states wit the same _energy is the » degeneracy of that level 
represented by gy and go. Einstein considered what happens to an atom 
interacting with radiation of | energy density p(w ) pe per unit frequency in- 


at a a proportional to neon ‘where the constant of of proportionality 


is s Bin. >. The atom interacts strongly only wit only with that part_of the distri- 
bution p(w) with a frequency close to [0 wig = (E2 — F,) /h, the atom’s 
resonant wnt frequency. 23 By symmetry it is also expected that the radiation 
will cause transitions from the upper to lower levels at_a rate dependent 
on the energy density but with a constant of proportionality Bo, (the 
subscripts are in a different order for emission as compared to absorp- 
tion). This is a process of stimulated emission in which the radiation 
at angular frequency w causes the atom to emit_radiation of the same 
frequency. This increase in the amount of light at the incident frequency 
is fundamental to the operation of lasers.2* The symmetry between up 
and down is broken by the process of spontaneous emission in which an 
atom falls down to the lower level, even when no external radiation is 
present. Einstein introduced the coefficient Ag; to represent the rate of 
this process. Thus the rate equations for the populations of the levels, 
N, and No, are 


dN: 
a7 = N, Bi2p(12) — N2 Boi p(w12) — No Ao; (1.25) 
d 
7 oe 1.26) 
dt — dt o 


The first equation gives the rate of change of N2 in terms of the absorp- 
tion, stimulated emission and spontaneous emission, respectively. The 
second equation is a consequence of having only two levels so that atoms 
leaving level 2 must go into level 1; this is equivalent to a condition that 
N;, + No = constant. When p(w) = 0, and some atoms are initially in 
the upper level (N2(0) 4 0), the equations have a decaying exponential 
solution: 


No(t) = N2(0) exp (—Agit) ) (1.27) 
_where the mean lifetime ane is 
——— Az (1.28) 
Ex; g2; No 
Fy; 91; Mi 


Einstein devised a clever argument to find the relationship between the 
A»1- and B-coefficients and this allows a complete treatment of atoms in- 
teracting with radiation. Einstein imagined what would happen to such 
an atom in a region of black-body radiation, e.g. inside a box whose sur- 
face acts as a black body. The energy density of the radiation p(w) dw 
between angular frequency w and w +dw depends only on the tempera- 
ture T' of the emitting (and absorbing) surfaces of the box; this function 
is given t riven by the Planck distribution law:7° 


3 
p(w) = be : 


i ie exp(hw/kpT) —1° oo) 


Now we consider the revel populations of an atom in this black-body 
radiation. At equilibrium the rates of change of N, and Ng» (in eqn 1.26) 
are both zero and from eqn 1.25 we find that 


Agi 1 


Biz (Ni /N2)(Bi2/Bo) — 1 (1.30) 


p(wi2) = 


At thermal equilibrium the population in each of the states within the 
levels are given by the Boltzmann factor (the population in each state 
equals that of the energy level divided by its degeneracy): 


No _ Nt oy (Pe 
~ P\ kT) 


131 
ae (1.31) 
Combining the last three equations (1.29, 1.30 and 1.31) we find?’ 


hus? 
Aoi = mie Bo (1.32) 
and ee 
Bio mS 1 Bo . (1.33) 


The Einstein coefficients are properties of the atom.”® Therefore these 
relationships between them hold for any type of radiation, from narrow- 
bandwidth radiation from a laser to broadband light. Importantly, 
eqn 1.32 shows that strong absorption is associated with strong emission. 
Like many of the topics covered in this chapter, Einstein’s treatment cap- 
tured the essential features of the physics long before all the details of 
the quantum mechanics were fully understood.?? 


1.8 The Zeeman effect 


This introductory survey of early atomic physics must include Zeeman’s 
important work on the effect of a magnetic field on atoms. The obser- 
vation of what we now call the Zeeman effect and three other crucial 
experiments were carried out just at the end of the nineteenth century, 
and together these discoveries mark the watershed between classical and 
quantum physics.*° Before describing Zeeman’s work in detail, I shall 
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26 Planck was the first to consider radi- 
ation quantised into photons of energy 
hw. See Pais (1986). 


27 These equations hold for all T, so 
we can equate the parts that contain 
exp(hw/kpT) and the temperature- 
independent factors separately to ob- 
tain the two equations. 


28-This is shown explicitly in Chapter 7 
by a time-dependent perturbation the- 
ory calculation of Bjg. 


29To excite a significant fraction of the 
population into the upper level of a visi- 
ble transition would require black-body 
radiation with a temperature compara- 
ble to that of the sun, and this method 
is not generally used in practice—such 
transitions are easily excited in an elec- 
trical discharge where the electrons im- 
part energy to the outermost electrons 
in an atom. (The voltage required to 
excite weakly-bound outer electrons is 
much less than for X-ray production.) 


30 pais (1986) and Segré (1980) give his- 
torical accounts. 
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31This led to the measurement of the 
atomic X-ray spectra by Moseley de- 
scribed in Section 1.5. 


32 The field of nuclear physics was later 
developed by Rutherford, and others, 
to show that atoms have a very small 
dense nucleus that contains almost all 
the atomic mass. For much of atomic 
physics it is sufficient to think of the 
nucleus as a positive charge +Ze at the 
centre of the atoms. However, some un- 
derstanding of the size, shape and mag- 
netic moments of nuclei is necessary to 
explain the hyperfine structure and iso- 
tope shift. 


33-This is the same force that Thomson 
used to deflect free electrons in a curved 
trajectory to measure e/me. Nowadays 
such cathode ray tubes are commonly 
used in classroom demonstrations. 


briefly mention the other three great breakthroughs and their signifi- 
cance for atomic physics. Rontgen discovered mysterious X-rays emit- 
ted from discharges, and sparks, that could pass through matter and 
blacken photographic film.?4 At about the same time, Bequerel’s dis- 
covery of radioactivity opened up the whole field of nuclear physics.°? 
Another great breakthrough was J. J. Thomson’s demonstration that 
cathode rays in electrical discharge tubes are charged particles whose 
charge-to-mass ratio does not depend on the gas in the discharge tube. 
At almost the same time, the observation of the Zeeman effect of a mag- 
netic field showed that there are particles with the same charge-to-mass 
ratio in atoms (that we now call electrons). The idea that atoms con- 
tain electrons is very obvious now but at that time it was a crucial piece 
in the jigsaw of atomic structure that Bohr put together in his model. 
In addition to its historical significance, the Zeeman effect provides a 
very useful tool for examining the structure of atoms, as we shall see 
at several places in this book. Somewhat surprisingly, it is possible to 
explain this effect by a classical-mechanics line of reasoning (in certain 
special cases). An atom in a magnetic field can be modelled as a simple 
harmonic oscillator. The restoring force on the electron is the same for 
displacements in all directions and the oscillator has the same resonant 
frequency wo for motion along the z-, y- and z-directions (when there is 
no magnetic field). In a magnetic field B the equation of motion for an 
electron with charge —e, position r and velocity v = r is 


¥ 2 
Me—— = =a, T — ev Gis. 


= (1.34) 


In addition to the restoring force (assumed to exist without further ex- 
planation), there is the Lorentz force that occurs for a charged particle 
moving through a magnetic field.?* Taking the direction of the field to 
be the z-axis, B = Be, leads to 


r+ 20.r x @, +wer =0. (1.35) 
This contains the Larmor frequency 
eB 
Wit : : 
L Ome (1.36) 


We use a matrix method to solve the equation and look for a solution 
in the form of a vector oscillating at w: 


i 
r= Re y | exp (—iwt) (1.3%) 
e 
Written in matrix form, eqn 1.35 reads 
we iw, 0 L 3 
Zw, we 0 (Nae - (1.38) 


0 0 we z z 


The eigenvalues w? are found from the following determinant: 


we —w* iw, 0 
Zw, we — w? 0 =e 
0 0 we — Ww 


(1.39) 


This gives {w* — (2wé + 4?) w? + wi} (w? — w2) = 0. The solution 
W = Wo is obvious by inspection. The other two eigenvalues can be found 
exactly by solving the quadratic equation for w? inside the curly brackets. 
For an optical transition we always have Q, < wo so the approximate 
eigenfrequencies are w ~ wo + Ny. Substituting these values back into 
eqn 1.38 gives the eigenvectors corresponding to w = wo — Ny, wo and 
wo + OL, respectively, as 


cos (wo — OQ) t 0 
r=] —sin(w—OzL)t |, 0 
0 COS Wot 


cos (Wo + OL) t 
sin (Wo + Nz) t 
0 


and 


The magnetic field does not affect motion along the z-axis and the angu- 
lar frequency of the oscillation remains wo. Interaction with the magnetic 
field causes the motions in the z- and y-directions to be coupled together 
(by the off-diagonal elements +2iwy of the matrix in eqn 1.38).°4 The 
result is two circular motions in opposite directions in the ry-plane, as 
illustrated in Fig. 1.5. These circular motions have frequencies shifted 
up, or down, from wo by the Larmor frequency. Thus the action of the 
external field splits the original oscillation at a single frequency (actu- 
ally three independent oscillations all with the same frequency, wo) into 
three separate frequencies. An oscillating electron acts as a classical 
dipole that radiates electromagnetic waves and Zeeman observed the 
frequency splitting Qy in the light emitted by the atom. 

This classical model of the Zeeman effect explains the polarization 
of the light, as well as the splitting of the lines into three components. 
The calculation of the polarization of the radiation at each of the three 
different frequencies for a general direction of observation is straight- 
forward using vectors;*” however, only the particular cases where the 
radiation propagates parallel and perpendicular to the magnetic field 
are considered here, i.e. the longitudinal and transverse directions of 
observation, respectively. An electron oscillating parallel to B radiates 
an electromagnetic wave with linear polarization and angular frequency 
Wo. This z-component of the line is observed in all directions except 
along the magnetic field;?® in the special case of transverse observation 
(i.e. in the xy-plane) the polarization of the 7-component lies along 
e,. The circular motion of the oscillating electron in the xry-plane at 
angular frequencies wo + Q, and wo — Qy produces radiation at these 
frequencies. Looking transversely, this circular motion is seen edge-on 
so that it looks like linear sinusoidal motion, e.g. for observation along 
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Fig. 1.5 A simple model of an atom 
as an electron that undergoes simple 
harmonic motion explains the features 
of the normal Zeeman effect of a mag- 
netic field (along the z-axis). The 
three eigenvectors of the motion are: 
€, coswot and cos({wo NL} t)e@z + 
sin ({wo + Ny} t) ey. 


34The matrix does not have off- 
diagonal elements in the last column 
or bottom row, so the z- and y- 
components are not coupled to the z- 
component, and the problem effectively 
reduces to solving a 2 x 2 matrix. 


35 Some further details are given in Sec- 
tion 2.2 and in Woodgate (1980). 


36 An oscillating electric dipole pro- 
portional to @, coswot does not radi- 
ate along the z-axis—observation along 
this direction gives a view along the 
axis of the dipole so that effectively the 
motion of the electron cannot be seen. 
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Fig. 1.6 For the normal Zeeman effect a simple model of an atom (as in Fig. 1.5) explains the frequency of the light emitted 
and its polarization (indicated by the arrows for the cases of transverse and longitudinal observation). 


37 This is left-circularly-polarized light 
(Corney 2000). 


the z-axis only the y-component is seen, and the radiation is linearly 
polarized perpendicular to the magnetic field—see Fig. 1.6. These are 
called the o-components and, in contrast to the 7-component, they are 
also seen in longitudinal observation—looking along the z-axis one sees 
the electron’s circular motion and hence light that has circular polariza- 
tion. Looking in the opposite direction to the magnetic field (from the 
positive z-direction, or 9 = 0 in polar coordinates) the circular motion 
in the anticlockwise direction is associated with the frequency wo +Qy.°" 
In addition to showing that atoms contain electrons by measuring the 
magnitude of the charge-to-mass ratio e/m,., Zeeman also deduced the 
sign of the charge by considering the polarization of the emitted light. 
If the sign of the charge was not negative, as we assumed from the start, 
light at wo +Q, would have the opposite handedness—from this Zeeman 
could deduce the sign of the electron’s charge. 

For situations that only involve orbital angular momentum (and no 
spin) the predictions of this classical model correspond exactly to those 
of quantum mechanics (including the correct polarizations), and the in- 
tuition gained from this model gives useful guidance in more complicated 
cases. Another reason for studying the classical treatment of the Zee- 
man effect is that it furnishes an example of degenerate perturbation 
theory in classical mechanics. We shall encounter degenerate perturba- 
tion theory in quantum mechanics in several places in this book and an 


understanding of the analogous procedure in classical mechanics is very 
helpful. 


1.8.1 Experimental observation of the Zeeman 
effect 


Figure 1.7(a) shows an apparatus suitable for the experimental observa- 
tion of the Zeeman effect and Fig. 1.7(b-e) shows some typical experi- 
mental traces. A low-pressure discharge lamp that contains the atom to 
be studied (e.g. helium or cadmium) is placed between the pole pieces 
of an electromagnet capable of producing fields of up to about 1T. In 
the arrangement shown, a lens collects light emitted perpendicular to 
the field (transverse observation) and sends it through a Fabry—Perot 
étalon. The operation of such étalons is described in detail by Brooker 
(2003), and only a brief outline of the principle of operation is given 
here. 


(a) 


({) Gas inlet 


Electromagnet | 


To vacuum pump 


(b) (c) 
1.0 1.0 
0.5 0.5 

: 0.0 
0.0 ; 

(d) (e) 
1.0 1.0 
0.5 0.5 

0.0 
0.0 ; 


1.8 The Zeeman effect 17 


Fig. 1.7 (a) An apparatus suitable 
for the observation of the Zeeman ef- 
fect. The light emitted from a dis- 
charge lamp, between the pole pieces 
of the electromagnet, passes through 
a narrow-band filter and a Fabry- 
Perot étalon. Key: L1, L2 are lenses; 
F-filter; P-—polarizer to discriminate 
between z- and o-polarizations (op- 
tional); Fabry-Perot étalon made of 
a rigid spacer between two highly- 
reflecting mirrors (M1 and M2); D- 
detector. Other details can be found in 
Brooker (2003). A suitable procedure is 
to (partially) evacuate the étalon cham- 
ber and then allow air (or a gas with a 
higher refractive index such as carbon 
dioxide) to leak in through a constant- 
flow-rate valve to give a smooth linear 
scan. Plots (b) to (e) show the inten- 
sity I of light transmitted through the 
Fabry-Perot étalon. (b) A scan over 
two free-spectral ranges with no mag- 
netic field. Both (c) and (d) show a Zee- 
man pattern observed perpendicular to 
the applied field; the spacing between 
the 7- and o-components in these scans 
is one-quarter and one-third of the free- 
spectral range, respectively—the mag- 
netic field in scan (c) is weaker than 
in (d). (e) In longitudinal observation 
only the o-components are observed— 
this scan is for the same field as in (c) 
and the o-components have the same 
position in both traces. 


18 Early atomic physics 


357 equals the potential energy of the 
electron in the first Bohr orbit. 


39~This Larmor frequency equals the 
splitting between the a- and o- 
components in the normal Zeeman ef- 
fect. 


e Light from the lamp is collected by .a lens and directed on to an 
interference filter that transmits only a narrow band of wavelengths 
corresponding to a single spectral line. 


e The étalon produces an interference pattern that has the form of con- 
centric rings. These rings are observed on a screen in the focal plane 
of the lens placed after the étalon. A small hole in the screen is po- 
sitioned at the centre of the pattern so that light in the region of the 
central fringe falls on a detector, e.g. a photodiode. (Alternatively, 
the lens and screen can be replaced by a camera that records the ring 
pattern on film.) 


e The effective optical path length between the two flat highly-reflecting 
mirrors is altered by changing the pressure of the air in the cham- 
ber; this scans the étalon over several free-spectral ranges while the 
intensity of the interference fringes is recorded to give traces as in 
Fig. 1.7(b-e). 


1.9 Summary of atomic units 


This chapter has used classical mechanics and elementary quantum ideas 
to introduce the important scales in atomic physics: the unit of length 
ao and a unit of energy hcR... The natural unit of energy is e?/47€9a9 
and this unit is called a hartree.*® This book, however, expresses energy 
in terms of the energy equivalent to the Rydberg constant, 13.6eV; this 
equals the binding energy in the first Bohr orbit of hydrogen, or 1/2 a 
hartree. These quantities have the following values: 


he? lil 
ag = (e2/4m€9) me = 5.29 x 10 m, (1.40) 
e(€7/4r¢0)” 
hcRoo = cola) =iBGey (1.41) 


The use of these atomic units makes the calculation of other quantities 
simple, e.g. the electric field in a hydrogen atom at radius r = ag equals 
e/(Ame9aé). This corresponds to a potential difference of 27.2 V over a 
distance of ao, or a field of 5 x 10'! Vm7—!. 

Relativistic effects depend on the dimensionless fine-structure con- 


stant a: 214 ) 
e*/47e 1 
Ae: (1.42) 

he lis i 
The Zeeman effect of a magnetic field on atoms leads to a frequency shift 
of Ny, in eqn 1.36.°° In practical units the size of this frequency shift is 

Q, — (3 

2nB 4am. 
Equating the magnetic energy hOQ, with wpB, the magnitude of the 


energy for a magnetic moment jip in a magnetic flux density B, shows 
that the unit of atomic magnetic moment is the Bohr magneton 


eh 
aif 


=MGhzie. (1.43) 


ie = =e 102 cele ~ (1.44) 


This magnetic moment depends on the properties of the unpaired elec- 
tron (or electrons) in the atom, and has a similar magnitude for all 
atoms. In contrast, other atomic properties scale rapidly with the nu- 
clear charge; hydrogenic systems have energies proportional to Z?, and 
the same reasoning shows that their size is proportional to 1/Z (see 
eqns 1.40 and 1.41). For example, hydrogenic uranium U*®! has been 
produced in accelerators by stripping 91 electrons off a uranium atom 
to leave a single electron that has a binding energy of 92? x 13.6eV = 
115keV (for n = 1) and an orbit of radius aj9/92 = 5.75 x 107? m= 
575fm. The transitions between the lowest energy levels of this system 
have short wavelengths in the X-ray region.*° 

The reader might think that it would be a good idea to use the same 
units across the whole of atomic physics. In practice, however, the units 
reflect the actual experimental techniques used in a particular region 
of the spectrum, e.g. radio-frequency, or microwave synthesisers, are 
calibrated in Hz (kHz, MHz and GHz); the equation for the angle of 
diffraction from a grating is expressed in terms of a wavelength; and 
for X-rays produced by tubes in which electrons are accelerated by high 
voltages it is natural to use keV.*! A table of useful conversion factors 
is given inside the back cover. 

The survey of classical ideas in this chapter gives a historical perspec- 
tive on the origins of atomic physics but it is not necessary, or indeed 
in some cases downright confusing, to go through a detailed classical 
treatment—the physics at the scale of atomic systems can only properly 
be described by wave mechanics and this is the approach used in the 
following chapters.*? 
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40 Pnergies can be expressed in terms 
of the rest mass energy of the electron 
Mec? = 0.511 MeV. The gross energy 
is (Za)? 5mec? and the fine structure 
is of order (Za)? Mec’. 


417 aser techniques can measure tran- 
sition frequencies of around 10!° Hz 
directly as a frequency to determine 
a precise value of the Rydberg con- 
stant, and there are no definite rules for 
whether a transition should be specified 
by its energy, wavelength or frequency. 


42x ray spectra are not discussed again 
in this book and further details can be 
found in Kuhn (1969) and other atomic 
physics texts. 
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Exercises 


(1.1) Isotope shift 
The deuteron has approximately twice the mass of 
the proton. Calculate the difference in the wave- 
length of the Balmer-a line in hydrogen and deu- 
terium. 


(1.2) The energy levels of one-electron atoms 

The table gives the wavelength** of lines observed 
in the spectrum of atomic hydrogen and singly- 
ionized helium. Explain as fully as possible the 
similarities and differences between the two spec- 


tra. 


H(nm) He* (nm) 
656.28 656.01 
486.13 541.16 
434.05 485.93 
410.17 454.16 
433.87 
419.99 
410.00 


43These are the wavelengths in air with a refractive index of 1.0003 in the visible region. 
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(1.3) 


(1.4) 


(1.5) 


(1.6) 


(1.7) 


(1.9) 


(1.10) 


Relativistic effects 

Evaluate the magnitude of relativistic effects in 
the n = 2 level of hydrogen. What is the resolv- 
ing power A/(AX) min of an instrument that could 
observe these effects in the Balmer-a line? 


X-rays 

Show that eqn 1.21 approximates to eqn 1.20 when 
the atomic number Z is much greater than the 
screening factors. 


X-rays 

It is suspected that manganese (Z = 25) is very 
poorly mixed with iron (Z = 26) in a block of al- 
loy. Predict the energies of the K-absorption edges 
of these elements and determine an X-ray photon 
energy that would give good contrast (between re- 
gions of different concentrations) in an X-ray of 
the block. 


X-ray experiments 

Sketch an apparatus suitable for X-ray spectro- 
scopy of elements, e.g. Moseley’s experiment. 
Describe the principle of its operation and the 
method of measuring the energy, or wavelength, 
of X-rays. 


Fine structure in X-ray transitions 

Estimate the magnitude of the relativistic effects 
in the M-shell of lead (Z = 82) in keV. Also ex- 
press you answer as a fraction of the K, transition. 


Radiative lifetime 

For an electron in a circular orbit of radius r 
the electric dipole moment has a magnitude of 
D = —er and radiates energy at a rate given by 
eqn 1.22. Find the time 7 taken to lose an energy 
of hw. 

Use your expression to estimate 7 for the n = 3 
to n = 2 transition in hydrogen that emits light of 
wavelength 656 nm. 


Comment. This method gives 1/7 « (er)? w?, 
which corresponds closely to the quantum mechan- 
ical result in eqn 7.23. 


Black-body radiation 

Two-level atoms with a transition at wavelength 
A = 600 nm, between the levels with degeneracies 
g: = 1 and gz = 3, are immersed in black-body 
radiation. The fraction in the excited state is 0.1. 
What is the temperature of the black body and the 
energy density per unit frequency interval p(wi2) 
of the radiation at the transition frequency? 


Zeeman, effect 
What is the magnitude of the Zeeman shift for an 
atom in (a) the Earth’s magnetic field, and (b) a 


(1.11) 


(1.12) 


magnetic flux density of 1 T? Express your answers 
in both MHz, and as a fraction of the transition 
frequency Af/f for a spectral line in the visible. 


Relative intensities in the Zeeman effect 

Without an external field, an atom has no pre- 
ferred direction and the choice of quantisation axis 
is arbitrary. In these circumstances the light emit- 
ted cannot be polarized (since this would establish 
a preferred orientation). As a magnetic field is 
gradually turned on we do not expect the intensi- 
ties of the different Zeeman components to change 
discontinuously because the field has little effect 
on transition rates. This physical argument im- 
plies that oppositely-polarized components emit- 
ted along the direction of the field must have equal 
intensities, i.e. J,4 = I,- (notation defined in 
Fig. 1.6). What can you deduce about 


(a) the relative intensities of the components 
emitted perpendicularly to the field? 


(b) the ratio of the total intensities of light emit- 
ted along and perpendicular to the field? 


Bohr theory and the correspondence principle 

This exercise gives an alternative approach to the 
theory of the hydrogen atom presented in Sec- 
tion 1.3 that is close to the spirit of Bohr’s original 
papers. It is somewhat more subtle than that usu- 
ally given in elementary textbooks and illustrates 
Bohr’s great intuition. Rather than the ad hoc as- 
sumption that angular momentum is an integral 
multiple of A (in eqn 1.7), Bohr used the corre- 
spondence principle. This principle relates the be- 
haviour of a system according to the known laws 
of classical mechanics and its quantum properties. 


Assumption II The correspondence principle 
states that in the limit of large quantum numbers 
a quantum system tends to the same limit as the 
corresponding classical system. 


Bohr formulated this principle in the early days 
of quantum theory. To apply this principle to hy- 
drogen we first calculate the energy gap between 
adjacent electron orbits of radii r and r’. For large 
radii, the change Ar =r’ —r <r. 


(a) Show that the angular frequency w = AE/h 
of radiation emitted when an electron makes 
a quantum jump between these levels is 


ie e*/4m€o Ar 
~ Qh 7? 


(b) An electron moving in a circle of radius r acts 
as an electric dipole radiating energy at the 


(d) 


orbital frequency w given by eqn 1.4. Verify 
that this equation follows from eqn 1.3. 


In the limit of large quantum numbers, the 
quantum mechanical and classical expressions 
give the same frequency w. Show that equat- 
ing the expressions in the previous parts yields 
Ar =2(aor)'””. 

The difference in the radii between two ad- 
jacent orbits can be expressed as a difference 
equation.** In this case An = 1 and 


ne (1.45) 
This equation can be solved by assuming that 
the radius varies as some power x of the quan- 
tum number n, e.g. if one orbit is labelled 
by an integer n and the next by n+ 1, then 
r = an™ and r’ = a(n+1)*. Show that 
Ar = arn*~! x n*/?. Determine the power 
and the constant a. 


(1.13) 


Exercises for Chapter 1 21 


Comment. We have found eqn 1.8 from the cor- 
respondence principle without considering angular 
momentum. The allowed energy levels are easily 
found from this equation as in Section 1.3. The re- 
markable feature is that, although the form of the 
equation was derived for high values of the prin- 
cipal quantum number, the result works down to 
Sk 


Rydberg atoms 


(a) 


(b) 
(c) 


Show that the energy of the transitions be- 
tween two shells with principal quantum num- 
bers n and n’ = n+1 is proportional to 1/n° 
for large n. 


Calculate the frequency of the transition be- 
tween the n’ = 51 and n = 50 shells. 


What is the size of an atom in these Rydberg 
states? Express your answer both in atomic 
units and in metres. 


44.4 difference equation is akin to a differential equation but without letting the differences become infinitesimal. 
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1 The emphasis is on the properties of 
the wavefunctions rather than how to 
solve differential equations. 


*The operator for linear momentum is 
p = —ihV and for angular momentum 
itishl=r x p. This notation differs in 
two ways from that commonly used in 
quantum texts. Firstly, h is taken out- 
side the angular momentum operators, 
and secondly, the operators are written 
without ‘hats’. This is convenient for 
atomic physics, e.g. in the vector model 
for the addition of angular momenta. 


The hydrogen atom 


The simple hydrogen atom has had a great influence on the development 
of quantum theory, particularly in the first half of the twentieth century 
when the foundations of quantum mechanics were laid. As measurement 
techniques improved, finer and finer details were resolved in the spec- 
trum of hydrogen until eventually splittings of the lines were observed 
that cannot be explained even by the fully relativistic formulation of 
quantum mechanics, but require the more advanced theory of quantum 
electrodynamics. In the first chapter we looked at the Bohr-Sommerfeld 
theory of hydrogen that treated the electron orbits classically and im- 
posed quantisation rules upon them. This theory accounted for many of 
the features of hydrogen but it fails to provide a realistic description of 
systems with more than one electron, e.g. the helium atom. Although 
the simple picture of electrons orbiting the nucleus, like planets round 
the sun, can explain some phenomena, it has been superseded by the 
Schrodinger equation and wavefunctions. This chapter outlines the ap- 
plication of this approach to solve Schrédinger’s equation for the hydro- 
gen atom; this leads to the same energy levels as the Bohr model but 
the wavefunctions give much more information, e.g. they allow the rates 
of the transitions between levels to be calculated (see Chapter 7). This 
chapter also shows how the perturbations caused by relativistic effects 
lead to fine structure. 


2.1 The Schrodinger equation 


The solution of the Schrédinger equation for a Coulomb potential is 
in every quantum mechanics textbook and only a brief outline is given 
here.’ The Schrédinger equation for an electron of mass me in a 
spherically-symmetric potential is 


{=v +Vvin)} y= Ey. 


ae (2.1) 


This is the quantum mechanical counterpart of the classical equation 
for the conservation of total energy expressed as the sum of kinetic and 
potential energies.” In spherical polar coordinates we have 


(2.2) 


where the operator 1° contains the terms that depend on 6 and ¢, namely 


p=-{ yee 

= sin @ 06 06 sin? 9 Od? J ’ 
and h?1? is the operator for the orbital angular momentum squared. Fol- 
lowing the usual procedure for solving partial differential equations, we 


look for a solution in the form of a product of functions 7 = R(r)Y (6, ¢). 
The equation separates into radial and angular parts as follows: 


ago, © =) _ 2mer* 


C3) 


1, 
Each side depends on different variables and so the equation is only 
satisfied if both sides equal a constant that we call 6. Thus 

PY =bY. (275) 
This is an eigenvalue equation and we shall use the quantum theory of 
angular momentum operators to determine the eigenfunctions Y (6, ¢). 


2.1.1 Solution of the angular equation 


To continue the separation of variables we substitute Y = O(6)®(¢) into 
eqn 2.5 to obtain 


sindd (. , 00 eye 
“© 80 (sine a) + bsin i 


_1e6 
b Oe? 


= const. (2.6) 


The equation for @(¢) is the same as in simple harmonic motion, so® 


© = Act? 4 Bese (2.7) 
The constant on the right-hand side of eqn 2.6 has the value m?. Phys- 
ically realistic wavefunctions have a unique value at each point and this 
imposes the condition ®(¢ + 27) = ®(¢), so m must be an integer. 
The function ®(¢) is the sum of eigenfunctions of the operator for the 
z-component of orbital angular momentum 


(2.8) 


The function e’’"? has magnetic quantum number m and its complex 
conjugate ei”? has magnetic quantum number —m.* 

A convenient way to find the function Y (6, @) and its eigenvalue b in 
eqn 2.5° is to use the ladder operators 1, = /, + ily and IL = l, — ily. 
These operators commute with I’, the operator for the total angular 
momentum squared (because J, and l, commute with 1°); therefore, the 
three functions Y, 1,Y and /_Y are all eigenfunctions of I? with the 
same eigenvalue b (if they are non-zero, as discussed below). The ladder 
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34 and B are arbitrary constants. 
Alternatively, the solutions can be 
written in terms of real functions as 
A’ sin(m@) + B’ cos(m@). 


4The operator —02/0¢2 = I2 and con- 
sequently ©(¢) is an eigenfunction of J? 
with eigenvalue m?. 


©The solution of equations involving 
the angular part of V? arises in many 
situations with spherical symmetry, e.g. 
in électrostatics, and the same mathe- 
matical tools could be used here to de- 
termine the properties of the spherical 
harmonic functions, but angular mo- 
mentum methods give more physical in- 
sight for atoms. 
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6 The raising operator contains the fac- 
tor e'?, so that when it acts on an eigen- 
function of the form Y « O(@)ei™”? 
the resulting function [,Y contains 
ei(m+1)¢, The 6-dependent part of this 
function is found below. 


7 These properties follow from the com- 
mutation relations for angular momen- 
tum operators (see Exercise 2.1). 


8This statement can be proved rigor- 
ously using angular momentum opera- 
tors, as shown in quantum mechanics 
texts. 


®The dubious reader can easily check 
that 14Y;,; = 0. It is trivially obvious 
that 12 ¥;,; =1Y,,1, where m = | for this 
function. 


ou eg ee = 0 and m = —1 is the low- 
est eigenvalue of /,. Proportional signs 
have been used to avoid worrying about 
normalisation; this leaves an ambiguity 
about the relative phases of the eigen- 
functions but we shall choose them in 
accordance with usual convention. 

11The relation Yj,-m = Y;",, shows 
that, if mmax = l, then mmin = —l. 
Between these two extremes there are 
21+ 1 possible values of the magnetic 
quantum number m for each J. Note 
that the orbital angular momentum 
quantum number | is not the same as 
the length of the angular momentum 
vector (in units of h). Quantum me- 
chanics tells us only that the expecta- 
tion value of the square of the orbital 
angular momentum is /(J+ 1), in units 
of h?. The length itself does not have a 
well-defined value in quantum mechan- 
ics and it does not make sense to re- 
fer to it. When people say that an 
atom has ‘orbital angular momentum 
of one, two, etc.’, strictly speaking they 
mean that the orbital angular momen- 
tum quantum number | is 1, 2, etc. 


operators can be expressed in polar coordinates as: 
: 0 6) 
iL, =e? | — + icotd— ], 
a (5 ad 


= Oo O 
—id St eon 
: ‘i ( fale) as tox). 


The operator 1, transforms a function with magnetic quantum number 
m into another angular momentum eigenfunction that has eigenvalue 
m+1. Thus 1, is called the raising operator. The lowering operator J_ 
changes the magnetic quantum number in the other direction, m + m— 
1. It is straightforward to prove these statements and other properties 
of these operators;’ however, the purpose of this section is not to present 
the general theory of angular momentum but simply to outline how to 
find the eigenfunctions (of the angular part) of the Schrodinger equation. 

Repeated application of the raising operator does not increase m 
indefinitely—for each eigenvalue b there is a maximum value of the mag- 
netic quantum number® that we shall call 1, i.e. mMmax = /. The raising 
operator acting on an eigenfunction with Mmax gives zero since by def- 
inition there are no eigenfunctions with m > Mmax. Thus solving the 
equation 1, Y = 0 (Exercise 2.11) we find that the eigenfunctions with 
Mmax = | have the form 


(2.9) 


Y x sin'Ge!?. (2.10) 


Substitution back into eqn 2.5 shows that these are eigenfunctions I? with 
eigenvalue b = I(1+ 1), and / is the orbital angular momentum quantum 
number. The functions Y;,(0,@) are labelled by their eigenvalues in the 
conventional way.’ For | = 0 only m = 0 exists and Yoo is a constant 
with no angular dependence. For / = 1 we can find the eigenfunctions 
by starting from the one with 1 = 1 = m (in eqn 2.10) and using the 
lowering operator to find the others: 


. i 
Ya x sin Be’? , 
Yoo! 7 cccosd. 


Yai X 1 Yaoi sinfe i? . 


This gives all three eigenfunctions expected for | = 1.1° For 1 = 2 this 
procedure gives 
Yoo « sin? Gei2¢ , 


Yo,-2 x sin? @e7i2¢ , 


These are the five eigenfunctions with m = 2, 1,0, --1, —2.1! Normalised 
angular functions are given in Table 2.1. 
Any angular momentum eigenstate can be found from eqn 2.10 by 


Table 2.1 Orbital angular momentum eigenfunctions. 


1 
ag = = 
Ava = = cos 6 
3 . 
Ce = sin 6 e*'? 
Voce 5 
2,0 = i Te (3 cos” 6—1) 


+i¢ 


1 
Yo41 = + = sin cos 6 ¢ 


Wi io) = a; aes sin 26 ae 


20 
Normalisation: i de [Yim|7 sin@d@d¢ = 1 
0 0 


repeated application of the lowering operator:!? 
Wood.) sin’ Ge"? (2) 


To understand the properties of atoms, it is important to know what 
the wavefunctions look like. The angular distribution needs to be mul- 
tiplied by the radial distribution, calculated in the next section, to give 
the square of the wavefunction as 


| (7,0, 6)/° = R24 (r) [Yim (6,4)? « (2.12) 


This is the probability distribution of the electron, or —e {b|? can be in- 
terpreted as the electronic charge distribution. Many atomic properties, 
however, depend mainly on the form of the angular distribution and 
Fig. 2.1 shows some plots of |¥j,m|’. The function |Yo, o|” is spherically 
symmetric. The function |Y;,9|” has two lobes along the z-axis. The 
squared modulus of the other two eigenfunctions of | = 1 is proportional 
to sin? @. As shown in Fig. 2.1(c), there is a correspondence between 
these distributions and the circular motion of the electron around the 
z-axis that we found as the normal modes in the classical theory of the 
Zeeman effect (in Chapter 1).1° This can be seen in Cartesian coordi- 
nates where 


x z 
1,0) CS = 5 
tf 


eee - ao (2.13) 


Toate) 


vied x 


2.1 The Schrodinger equation 25 


12-This eigenfunction has magnetic 
quantum number | — (J —m) = 


13 Stationary states in quantum 
mechanics correspond to the time- 
averaged classical motion. In this 
case both directions of circular mo- 
tion about the z-axis give the same 
distribution. 
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Fig. 2.1 Polar plots of the squared modulus of the angular wavefunctions for the hydrogen atom with | = 0 and 1. For each 
value of the polar angle @ a point is plotted at a distance proportional to |Y(6,¢)|? from the origin. Except for (d), the plots 
have rotational symmetry about the z-axis and look the same for any value of ¢. (a) |Yo,o|” is spherical. (b) [1,0]? « cos? 6 
has two lobes along the z-axis. (c) |Y¥i,1|? « sin? @ has an ‘almost’ toroidal shape—this function equals zero for @ = 0. (|Y1,~1 le 


looks the same.) (d) |¥1,1 + bea od \a/r|? has rotational symmetry about the z-axis and this polar plot is drawn for ¢ = 0; 
it looks like (b) but rotated through an angle of 7/2. (e) |Y2,2|? « sin* 6. 


Any linear combination of these is also an eigenfunction of I’, e.g. 


Yi,-1 — Yi «x — =sin9@ cos¢, (2.14) 


Y,,-1+ Yi, « = =sin@ sing. (2.15) 


sjc 3/8 


147, the absence of an external field 


to break the spherical symmetry, all Th : r 
: none ese two real functions have the san 
axes are equivalent, i.e. the atom does Covhowseline shape sae Yio wi 2/r but are 


not Malveea preferredédirection*sothere aligned along the x- and y-axes, respectively.'* In chemistry these dis- 
is symmetry between the z-, y- and tributions for | = 1 are referred to as p-orbitals. Computer programs 
z-directions. In an external magnetic can produce plots of such functions from any desired viewing angle (see 
field the states with different values of Blundell 2001, Fig. 3.1) that are helpful i : ieeoadaat : th 
F< ic Aaa. | net: ASTRA PBiges: t are helpful in visualising the functions wit 
aWelesollinearconibinetionswefthemare |! > 1. (Fort = Ovand Warcross*setrion* of themtietions nga pla eat 
not eigenstates of the system. contains the symmetry axis suffices.) 


2.1.2 Solution of the radial equation 


An equation for R(r) is obtained by setting eqn 2.4 equal to the constant 
b = I(l +1) and putting in the Coulomb potential V(r) = —e?/4zeor. It 


can be cast in a convenient form by the substitution P(r) = rR (r): 


le der A? 1(l+1)  e?/4re9 
~Smea? + Sc eB} P =o. (2.16) 


The term proportional to (1 + 1)/r? is the kinetic energy associated 
with the angular degrees of freedom; it appears in this radial equation 
as an effective potential that tends to keep wavefunctions with | 4 0 
away from the origin. Dividing through this equation by E = —|E| 
(a negative quantity since E < 0 for a bound state) and making the 


substitution 
2 2me|E|r? 


i (2.17) 
reduces the equation to the dimensionless form 
a’°P Id+1) A 
ae t {oe tt nilpno. (2.18) 


The constant that characterises the Coulomb interaction strength is 


e | 2m. 


The standard method of solving such differential equations is to look for 
a solution in the form of a series. The series solutions have a finite num- 
ber of terms and do not diverge when \ = 2n, where n is an integer.° 
Thus, from eqn 2.19, these wavefunctions have eigenenergies given by!® 


Qme (e2/4m€9)° 1 

E= eet) = —heRoo= - (2.20) 
This shows that the Schrédinger equation has stationary solutions at en- 
ergies given by the Bohr formula. The energy does not depend on J; this 
accidental degeneracy of wavefunctions with different / is a special fea- 
ture of Coulomb potential. In contrast, degeneracy with respect to the 
magnetic quantum number m, arises because of the system’s symmetry, 
i.e. an atom’s properties are independent of its orientation in space, in 
the absence of external fields.‘ The solution of the Schrédinger equation 
gives much more information than just the energies; from the wavefunc- 
tions we can calculate other atomic properties in ways that were not 
possible in the Bohr-Sommerfeld theory. 

We have not gone through the gory details of the series solution, but 
we should examine a few examples of radial wavefunctions (see Ta- 
ble 2.2). Although the energy depends only on n, the shape of the 
wavefunctions depends on both n and / and these two quantum num- 
bers are used to label the radial functions R,1(r). For n = 1 there is 
only the / = 0 solution, namely Rio «x e ’. For n = 2 the orbital 
angular momentum quantum number is / = 0 or 1, giving 


igen CS (ie) mae 
Ro x pe ©. 
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15The solution has the general form 
P(p) = Ce~?v(p), where v(p) is an- 
other function of the radial coordinate, 
for which there is a polynomial solution 
(see Woodgate 1980 and Rae 1992). 


16 Using eqn 1.41. 


I7This is true for any spherically- 
symmetric potential V(r). 
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Table 2.2 Radial hydrogenic wavefunctions R,,; in terms of the variable p = 
Zr /(nag), which gives a scaling that varies with n. The Bohr radius ao is defined in 
eqn 1.40. 


3/2 
AV/2 il = 
3 °F (2 ~ +p) : 


These show a general a feature of hydrogenic wavefunctions, namely 
that the radial functions for / = 0 have a finite value at the origin, i.e. 
the power series in p starts at the zeroth power. Thus electrons with 
I = 0 (called s-electrons) have a finite probability of being found at the 
position of the nucleus and this has important consequences in atomic 
physics. 

Inserting |E| from eqn 2.20 into eqn 2.17 gives the scaled coordinate 


aie (2.21) 


where the atomic number has been incorporated by the replacement 
e?/A4n€g — Ze?/4req (as in Chapter 1). There are some important prop- 
erties of the radial wavefunctions that require a general form of the 
solution and for future reference we state these results. The probability 
density of electrons with | = 0 at the origin is 


Into (0)? = — (2) (2.22) 


us nag 


For electrons with | 4 0 the expectation value of 1/r? is 


(3) = iP SR, ieee de — ae (2) . (2,23) 


These results have been written in a form that is easy to remember; 
they must both depend on 1/a% in order to have the correct dimensions 
and the dependence on Z follows from the scaling of the Schrédinger 


equation. The dependence on the principal quantum number n also 
seems to follow from eqn 2.21 but this is coincidental; a counterexample 


is? 
yo iS 
r/  n2\ao/- 
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(2.24) 


The wavefunction solutions of the Schrédinger equation for particular 
energies are standing waves and give a distribution of electronic charge 
~e|y(r)|? that is constant in time. We shall now consider how transi- 
tions between these stationary states occur when the atom interacts with 
electromagnetic radiation that produces an oscillating electric field!® 

E (t) = |Eo| Re (e7'* €;aa) (2.25) 
with constant amplitude |Eo| and polarization vector @,aq.2° If w lies 
close to the atomic resonance frequency then the perturbing electric 
field puts the atom into a superposition of different states and induces 
an oscillating electric dipole moment on the atom (see Exercise 2.10). 
The calculation of the stimulated transition rate requires time-dependent 
perturbation theory (TDPT), as described in Chapter 7. However, the 
treatment from first principles is lengthy and we shall anticipate some 
of the results so that we can see how spectra relate to the underlying 
structure of the atomic energy levels. This does not require an exact 
calculation of transition rates, but we only need to determine whether 
the transition rate has a finite value or whether it is zero (to first order), 
i.e. whether the transition is allowed and gives a strong spectral line, or 
is forbidden. 

The result of time-dependent perturbation theory is encapsulated in 
the golden rule (or Fermi’s golden rule);”’ this states that the rate of 
transitions is proportional to the square of the matrix element of the 
perturbation. The Hamiltonian that describes the time-dependent in- 
teraction with the field in eqn 2.25 is H’ = er- E(t), where the electric 
dipole operator is —er.?? This interaction with the radiation stimulates 
transitions from state 1 to state 2 at a rate”? 


2 
Rate « |eEo|” [vi (r+ @raa) Yi d?r| = |eEo|” x |(2|r- Gag |1)I? « 

(2.26) 

The concise expression in Dirac notation is convenient for later use. This 

treatment assumes that the amplitude of the electric field is uniform over 

the atom so that it can be taken outside the integral over the atomic 

wavefunctions, i.e. that Eg does not depend on r.74 We write the dipole 


matrix element as the product 
(2| Tr: rad {1) = Di Tang : (227) 


The radial integral is?° 
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18 This quantity is related to the quan- 
tum mechanical expectation value of 
the potential energy (p.e.); as in the 
Bohr model the total energy is H = 


(p.e.) /2. 


19-The interaction of atoms with the os- 
cillating magnetic field in such a wave is 
considerably weaker; see Appendix C. 


20-'The unit vector Gada gives the direc- 
tion of the oscillating electric field. For 
example, for the simple case of linear 
polarization along the x-axis €,,2q = Ex 
and the real part of e~i”* is cos(wt); 
therefore E(t) = |Eo| cos(wt) €z. 


21 See quantum mechanics texts such as 
Mandi (1992). 


22-This is analogous to the interaction 
of a classical dipole with an electric 
field. Atoms do not have a perma- 
nent dipole moment, but one is induced 
by the oscillating electric field. For a 
more rigorous derivation, see Woodgate 
(1980) or Loudon (2000). 


23-The maximum transition rate occurs 
when w, the frequency of the radiation, 
matches the transition frequency w}2, 
as discussed in Chapter 7. Note, how- 
ever, that we shall not discuss the so- 
called ‘density of states’ in the golden 
rule since this is not straightforward for 
monochromatic radiation. 


247 eqn 2.25 the phase of the wave is 
actually (wt —k-r), where r is the co- 
ordinate relative to the atom’s centre 
of mass (taken to be the origin) and k 
is the wavevector. We assume that the 
variation of phase k - r is small over the 
atom (kag < 27). This is equivalent to 
A > ao, i.e. the radiation has a wave- 
length much greater than the size of the 
atom. This is called the dipole approx- 
imation. 


25 Note that Dy2 = Doi. 
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?6Tf either the atoms have random ori- 
entations (e.g. because there is no ex- 
ternal field) or the radiation is unpo- 
larized (or both), then an average over 
all angles must be made at the end of 
the calculation. 


ir = / Rests (FER) r? dr. (2.28) 
0 


The angular integral is 


Qa WT 
Tang = [ [ Vit m, (0,4) F-Sraa Yi ym (8,0) sin@d0dd, (2.29) 


where Ff = r/r. The radial integral is not normally zero although it can 
be small for transitions between states whose radial wavefunctions have 
a small overlap, e.g. when nm, is small and ng is large (or the other way 
round). In contrast, the Zang = 0 unless strict criteria are satisfied— 
these are the selection rules. 


2.2.1 Selection rules 


The selection rules that govern allowed transitions arise from the angular 
integral in eqn 2.29 which contains the angular dependence of the inter- 
action r-€,,q for a given polarization of the radiation. The mathematics 
requires that we calculate Zang for an atom with a well-defined quanti- 
sation axis (invariably chosen to be the z-axis) and radiation that has a 
well-defined polarization and direction of propagation. This corresponds 
to the physical situation of an atom experiencing the Zeeman effect of an 
external magnetic field, as described in Section 1.8; that treatment of the 
electron as a classical oscillator showed that the components of differ- 
ent frequencies within the Zeeman pattern have different polarizations. 
We use the same nomenclature of 7- and o-transitions here; transverse 
observation refers to radiation emitted perpendicular to the magnetic 
field, and longitudinal observation is along the z-axis.?° 

To calculate Zang we write the unit vector r in the direction of the 
induced dipole as: 


<i Z Zs 
rt 7 (ez + yey + 2€:) 


= sin 6 cos @e, + sin@sin Pe, + cose, . (2.30) 
Expressing the functions of @ and ¢ in terms of spherical harmonic func- 
tions as 
Z 
sin @ cos ¢ = 4/ = (Yi-1 -—NYia), 
j : ede 
sin# sind =i a (Wier a), (2.31) 
An 
6=4/—Y; 
COs 3 1,05 
leads to 


V2 


y ~ 
1 pes ae 
’ J/2 1,0 z PL 


We write the general polarization vector as 


(2.32) 


e; a=, ie, 


V2 


Crad = AG = 


a é, +i€ 
= Ave oe (-*4 5°) (2.39) 


where A, depends on the component of the electric field along the z- 
axis and the component in the xy-plane is written as a superposition of 
two circular polarizations with amplitudes A,+ and A,- (rather than 
in terms of linear polarization in a Cartesian basis).2’ Similarly, the 
classical motion of the electron was written in terms of three eigenvectors 
in Section 1.8: an oscillation along the z-axis and circular motion in the 
xy-plane, both clockwise and anticlockwise. 

From the expression for r in terms of the angular functions Y; ;, (9, @) 
with / = 1 we find that the dipole induced on the atom is proportional 
to28 


T- Craa = As Yy 4 a AzYi,0 aR Ag+ ¥i41 : (2.34) 


The following sections consider the transitions that arise from these three 
ferms.7? 


w-transitions 


The component of the electric field along the z-axis A, induces a dipole 
moment on the atom proportional to @,4q -é€, = cos@ and the integral 
over the angular parts of the wavefunctions is 


27 ph 
Ting = i / ‘p,m (9,0) cosO Yi, m; (8, ¢) sind dd dd. (2.35) 


To determine this integral we exploit the symmetry with respect to ro- 
tations about the z-axis.°° The system has cylindrical symmetry, so the 
value of this integral is unchanged by a rotation about the z-axis through 
an angle ¢o: 

TS ese a So 


(2.36) 
This equation is satisfied if either ZZ,, = 0 or mj, = mi. For this 


polarization the magnetic quantum number does not change, Am; = 0.3! 


o-transitions 


The component of the oscillating electric field in the zy-plane excites o- 
transitions. Equation 2.34 shows that the circularly-polarized radiation 
with amplitude A,+ excites an oscillating dipole moment on the atom 
proportional Y,,; « sin6e'®, for which the angular integral is 


2m pt 
I, = / i Vis ma (6,9) sind e® Y;, m, (0,¢) sin@d0d¢. (2.37) 


Again, consideration of symmetry with respect, to rotation about the z- 
axis through an arbitrary angle shows that Zz, = 0 unless m;, — mj, + 
1=0. The interaction of an atom with circularly-polarized radiation of 
the opposite handedness leads to a similar integral but with e!? > e~'?; 
this integral La = 0 unless m;, — mi, — 1 = 0. Thus the selection rule 
for the o-transitions is Am; = +1. 

We have found the selection rules that govern Am, for each of the 


three possible polarizations of the radiation separately. These apply 
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27 We will see that the labels 7, o+ and 
go” refer to the transition that the radi- 
ation excites; for this it is only impor- 
tant to know how the electric field be- 
haves at the position of the atom. The 
polarization state associated with this 
electric field, e.g. whether it is right- 
or left-handed circularly-polarized radi- 
ation, also depends on the direction of 
propagation (wavevector), but we shall 
try to avoid a detailed treatment of the 
polarization conventions in this discus- 
sion of the principles. Clearly, however, 
it is important to have the correct po- 
larization when setting up actual exper- 
iments. 


28-The eigenvectors have the following 
properties: 


@, +ié, €,—iéy 
a Je a = 
and 
@, + i€y er + ie, = 
v2 v2 
297n spherical tensor notation 


(Woodgate 1980) the three vector 
components are written A_;, Ag and 
Aji, which is convenient for more 
general use; but writing eqn 2.34 as 
given emphasises that the amplitudes 
A represent the different polarizations 
of the radiation and the spherical 
harmonics come from the atomic 
response (induced dipole moment). 


30 A iternative methods are given below 
and in Exercise 2.9. 


31 We use my, to distinguish this quan- 
tum number from ms, the magnetic 
quantum number for spin angular mo- 
mentum that is introduced later. Spe- 
cific functions of the spatial variables 
such as Yj, and e~'? do not need 
this additional subscript. 
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32.9imilar behaviour arises in the clas- 
sical model of the normal Zeeman ef- 
fect in Section 1.8, but the quantum 
treatment in this section shows that 
it is a general feature of longitudinal 
observation—not just for the normal 
Zeeman effect. 


33See the references on angular mo- 
mentum in quantum mechanics; the 
reason why the magnetic quantum 
numbers add is obvious from ®(¢). 


34We have fo7 fi" Vir m?Yism sin 6.d0 do 
= 61',15m!',m- This reduces to the nor- 
malisation in Table 2.1 when l’ = 1 and 
m' =m. 


35 This argument applies only for elec- 
tric dipole radiation. Higher-order 
terms, e.g. quadrupole radiation, can 
give Al > 1. 


when the polarized light interacts with an atom that has a well-defined 
orientation, e.g. an atom in an external magnetic field. If the light 
is unpolarized or there is no defined quantisation axis, or both, then 
Angel. 


Example 2.1 Longitudinal observation 

Electromagnetic radiation is a transverse wave with its oscillating elec- 
tric field perpendicular to the direction of propagation, €raqa -k = 0. 
Thus radiation with wavevector k = ke, has A, = 0 and z-transitions 
do not occur.®? Circularly-polarized radiation (propagating along the z- 
axis) is a special case for which transitions occur with either Am; = +1 
or Am; = —1, depending on the handedness of the radiation, but not 
both. 


2.2.2 Integration with respect to 0 


In the angular integral the spherical harmonic functions with | = 1 (from 
eqn 2.34) are sandwiched between the angular momentum wavefunctions 
of the initial and final states so that 


2a rt 
Teens i / V5 ma Yism Yim, Sin 0 dO dd. (2.38) 
0 0 


To calculate this angular integral we use the following formula:** 


Veter Caan = AY Al meee ate 1S Sea r) (2.39) 


where A and B are constants whose exact values need not concern us. 
Thus from the orthogonality of the spherical harmonics** we find 
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The delta functions give the selection rule found previously, namely 
Am; = m, where m = 0,+1 depending on the polarization, and also 
Al = +1. In the mathematics the functions with | = 1 that represent 
the interaction with the radiation are sandwiched between the orbital 
angular momentum eigenfunctions of the initial and final states. Thus 
the rule AJ = +1 can be interpreted as conservation of angular momen- 
tum for a photon carrying one unit of angular momentum, h (Fig. 2.8 
illustrates this reasoning for the case of total angular momentum).°° The 
changes in the magnetic quantum number are also consistent with this 
picture—the component of the photon’s angular momentum along the 
z-axis being Am; = 0, +1. Conservation of angular momentum does not 
explain why Al 4 0—this comes about because of parity, as explained 
below. 


2.2.0 Larity 


Parity is an important symmetry property throughout atomic and molec- 
ular physics and its general use will be explained before applying it to 


selection rules. The parity transformation is an inversion through the 
origin given by r > —r. This is equivalent to the following transforma- 
tion of the polar coordinates: 


?0—+n-—6: areflection, 
¢g—gd+7: arotation. 


The reflection produces a mirror image of the original system and parity 
is also referred to as mirror symmetry. The mirror image of a hydrogen 
atom has the same energy levels as those in the original atom since the 
Coulomb potential is the same after reflection. It turns out that all the 
electric and magnetic interactions ‘look the same’ after reflection and 
all atoms have parity symmetry.°® To find the eigenvalues for parity we 
use the full quantum mechanical notation, with hats to distinguish the 
operator P from its eigenvalue P in the equation 


Py=Pyr, (2.40) 


from which it follows that P’y = P* w. Two successive parity operations 
correspond to there being no change (the identity operator), ie. r > 
—r +r. Thus P? = 1. Therefore the parity operator has eigenvalues 
P = 1 and —1 that correspond to even and odd parity wavefunctions, 
respectively: 


Puo=b or PpH=-v. 


Both eigenvalues occur for the spherical harmonic functions, 
ei an ire (2.41) 


The value of the angular integral does not change in a parity trans- 


formation?” so 
1 ao anise an (2.42) 


Thus the integral is zero unless the initial and final states have opposite 
parity (see Exercise 2.12). In particular, electric dipole transitions re- 
quire an odd change in the orbital angular momentum quantum number 
(Al Ao), 

The treatment above of the parity operator acting on a wavefunction 
is quite general and even in complex atoms the wavefunctions have a 
definite parity. The selection rules we have discussed in this section and 
others are tabulated in Appendix C. If the electric dipole matrix element 
is zero between two states then other types of transition may occur but 
at a rate many orders of magnitude slower than allowed transitions. 

The allowed transitions between the n = 1, 2 and 3 shells of atomic 
hydrogen are shown in Fig. 2.2, as an example of the selection rules. 
The 2s configuration has no allowed transitions downwards; this makes 
it metastable, i.e. it has a very long lifetime of about 0.125s.°? 

Finally, a comment on the spectroscopic notation. It can be seen 
in Fig. 2.2 that the allowed transitions give rise to several series of 
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36This can be proved formally in 
quantum mechanics by showing that 
the Hamiltonians for these interactions 
commute with the parity operator. The 
weak interaction in nuclear physics does 
not have mirror symmetry and violates 
parity conservation. The extremely 
small effect of the weak interaction on 
atoms has been measured in exceed- 
ingly careful and precise experiments. 


37 See, for example, Mandl (1992). 


38-The radial integral is not changed by 
the parity transformation. 


39 This special feature is used in the ex- 
periment described in Section 2.3.4. 
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Fig. 2.2 Allowed transitions between 
the configurations of hydrogen obey the 
selection rule Al = +1. The configu- 
Tamonsawith ¢ = 1,2,3,4,5,... are la- 
belled s, p, d, f, g, and so on alphabeti- 
cally (the usual convention). In the spe- 
cial case of hydrogen the energy does 
not depend on the quantum number I. 


40For hydrogen this is the Lyman se- 
ries, as marked on Fig. 1.1; however, 
p-series is a general name. 


41 These names reflect the appearance 
of the lines in the first experimental ob- 
servations. 


a By, considering elliptical orbits, 
rather than just circular ones, Som- 
merfeld refined Bohr’s theory to 
obtain a relativistic expression for the 
energy levels in hydrogen that gave 
very accurate predictions of the fine 
structure; however, details of that 
approach are not given here. 


BOS 3 
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lines. The series of lines to the ground configuration is called the p- 
series, where p stands for principal—this is the only series observed in 
absorption*®—hence p labels configurations with 1 = 1. The s-series of 
lines goes from / = 0 configurations (to a level with | = 1), and similarly 
the d-series goes from / = 2 configurations; s and d stand for sharp and 
diffuse, respectively.*! 


2.3. Fine structure 


Relativistic effects lead to small splittings of the atomic energy levels 
called fine structure. We estimated the size of this structure in Section 
1.4 by comparing the speed of electrons in classical orbits with the speed 
of light.4? In this section we look at how to calculate fine structure 
by treating relativistic effects as a perturbation to the solutions of the 
Schrodinger equation. This approach requires the concept that electrons 
have spin. 


2.3.1 Spin of the electron 


In addition to the evidence provided by observations of the fine structure 
itself, that is described in this section, two other experiments showed 
that the electron has spin angular momentum, not just orbital angular 
momentum. One of these pieces of experimental evidence for spin was 
the observation of the so-called anomalous Zeeman effect. For many 
atoms, e.g. hydrogen and sodium, the splitting of their spectral lines 
in a magnetic field does not have the pattern predicted by the normal 
Zeeman effect (that we found classically in Section 1.8). This anomalous 
Zeeman effect has a straightforward explanation in terms of electron 
spin (as shown in Section 5.5). The second experiment was the famous 
Stern—Gerlach experiment that will be described in Section 6.4.1.4% 
Unlike orbital angular momentum, spin does not have eigenstates that 
are functions of the angular coordinates. Spin is a more abstract con- 
cept and it is convenient to write its eigenstates in Dirac’s ket notation 
as |sm,;). The full wavefunction for a one-electron atom is the product of 


the radial, angular and spin wavefunctions: ¥ = Ry, 1(r) Yim (0,¢) |sms). 


Or, using ket notation for all of the angular momentum, not just the spin, 


W = R,,(r) |lmpsm,). (2.43) 


These atomic wavefunctions provide a basis in which to calculate the 
effect of perturbations on the atom. However, some problems do not 
require the full machinery of (degenerate) perturbation theory and for 
the time being we shall treat the orbital and spin angular momenta by 
analogy with classical vectors. To a large extent this vector model is 
intuitively obvious and we start to use it without formal derivations. 
But note the following points. An often-used shorthand for the spin 
eigenfunctions is spin-up: 


eri, =.) =f), (2.44) 


and similarly ||) for the m, = —3 state (spin-down). However, in quan- 
tum mechanics the angular momentum cannot be completely aligned 
‘up’ or ‘down’ with respect to the z-axis, otherwise the x- and y-comp- 
onents would be zero and we would know all three components simul- 
taneously.44 The vector model mimics this feature with classical vectors 
drawn with length |s| = ,/s(s +1) = V3/2. (Only the expectation 
value of the square of the angular momentum has meaning in quan- 
tum mechanics.) The spin-up and spin-down states are as illustrated 
in Fig. 2.3 with components along the z-axis of +3. We can think of 
the vector as rotating around the z-axis, or just having an undefined 
direction in the xy-plane corresponding to a lack of knowledge of the z- 
and y-components (see also Grant and Phillips 2001). 

The name ‘spin’ invokes an analogy with a classical system spinning on 
its axis, e.g. a sphere rotating about an axis through its centre of mass, 
but this mental picture has to be treated with caution; spin cannot be 
equal to the sum of the orbital angular momenta of the constituents since 
that will always be an integer multiple of h. In any case, the electron is 
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43-The fine structure, anomalous Zee- 
man effect and Stern—Gerlach exper- 
iment all involve the interaction of 
the electron’s magnetic moment with a 
magnetic field—the internal field of the 
atom in the case of fine structure. Stern 
and Gerlach detected the magnetic in- 
teraction by its influence on the atom’s 
motion, whereas the Zeeman effect and 
fine structure are observed by spectro- 


scopy. 


44-This is not possible since the oper- 
ators for the z-, y- and z-components 
of angular momentum do not commute 
(save in a few special cases; we can 
know that sz = sy = sz = 0 if s=0). 
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Fig. 2.3 The representation of (a) spin- 
up and (b) spin-down states as vectors 
precessing around the z-axis. 


45-The Biot-Savart law for the magnetic 
field from a current flowing along a 
straight wire can be recovered from the 
Lorentz transformation and Coulomb’s 
law (Griffiths 1999). However, this 
link can only be made in this direction 
for simple cases and generally the phe- 
nomenon of magnetism cannot be ‘de- 
rived’ in this way. 


(b) 


z 


ig 


a structureless elementary particle with no measurable size. So we are 
left with the experimental fact that the electron has an intrinsic spin 
angular momentum of h/2 and these half-integer values are perfectly 
acceptable within the general theory of angular momentum in quantum 
mechanics. 


2.3.2 The spin—orbit interaction 


The Schrédinger equation is non-relativistic, as can readily be seen by 
looking at the kinetic-energy operator that is equivalent to the non- 
relativistic expression p?/2m,e. Some of the relativistic effects can be 
taken into account as follows. An electron moving through an electric 
field E experiences an effective magnetic field B given by 


1 
Ba =y * E (2.45) 


This is a consequence of the way an electric field behaves under a Lorentz 
transformation from a stationary to a moving frame in special relativity. 
Although a derivation of this equation is not given here, it is certainly 
plausible since special relativity and electromagnetism are intimately 
linked through the speed of light c = 1/,/€oft9. This equation for the 
speed of electromagnetic waves in a vacuum comes from Maxwell’s equa- 
tions; €9 being associated with the electric field and pip with the magnetic 
field. Rearrangement to give tio = 1/ ( €9c”) suggests that magnetic 
fields arise from electrodynamics and relativity.*° 

We now manipulate eqn 2.45 into a convenient form, by substituting 
for the electric field in terms of the gradient of the potential energy V 
and unit vector in the radial direction: 


Gel arr 
E == -. 
eorr 


(2.46) 


The factor of e comes in because the electron’s potential energy V equals 
its charge —e times the electrostatic potential. From eqn 2.45 we have 


1 Mera MA h 
Be ( Be) Px me = 5 (LE Ni, (2.47) 
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where the orbital angular momentum is hl = r x mev. The electron 
has an intrinsic magnetic moment yz = —g,j/ps, where the spin has a 
magnitude of |s| = s = 1/2 (in units of h) and g, ~ 2, so the moment 


has a magnitude close to one Bohr magneton (ug = eh/2m.). The 
interaction of the electron’s magnetic moment with the orbital field gives 


the Hamiltonian 
1a), 
er Or 


However, this expression gives energy splittings about twice as large 
as observed. The discrepancy comes from the Thomas precession—a 
relativistic effect that arises because we are calculating the magnetic 
field in a frame of reference that is not stationary but rotates as the 
electron moves about the nucleus. The effect is taken into account by 
replacing g, with g,—1 ~ 1.** Finally, we find the spin-orbit interaction, 
including the Thomas precession factor, is*” 


(2.48) 


ie 10V 
|: fee —1 -— male 2.4 
sO (9s Smee €E Or ) s ( 9) 
For the Coulomb potential in hydrogen we have 
10V__ e?/4rre€o 
-— = ——. Des 
POF 7 (2.50) 
The expectation value of this Hamiltonian gives an energy change of*® 


Ree 2? fl 

Es—o a m2? Ameo (7) (6-0). 
The separation into a product of radial and angular expectation values 
follows from the separability of the wavefunction. The integral (1/r*) is 
given in eqn 2.23. However, we have not yet discussed how to deal with 
interactions that have the form of dot products of two angular momenta; 
let us start by defining the total angular momentum of the atom as the 
sum of its orbital and spin angular momenta, 


j=ul+s. (2.52) 


This is a conserved quantity for a system without any external torque 
acting on it, e.g. an atom in a field-free region of space. This is true 
both in classical and quantum mechanics, but we concentrate on the 
classical explanation in this section. The spin-orbit interaction between 
l and s causes these vectors to change direction, and because their sum 
is constrained to be equal to j they move around as shown in Fig. 2.4.*% 
Squaring and rearranging eqn 2.52, we find that 2s-1 = j? — P —s*. 
Hence we can find the expectation value in terms of the known values 
for (j?), (I?) and (s?) as 


1 
(ga eet) — als 1)) (2.53) 
Thus the spin-orbit interaction produces a shift in energy of 
E, Be ee aes 1). (2.54) 


= 5 
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46-This is almost equivalent to using 
gs/2 ~ 1, but gs — 1 is more accu- 
rate at the level of precision where the 
small deviation of gs from 2 is impor- 
tant (Haar and Curtis 1987). For fur- 
ther discussion of Thomas precession 
see Cowan (1981), Eisberg and Resnick 
(1985) and Munoz (2001). 


47 We have derived this classically, e.g. 
by using Al = r X mev. However, the 
same expression can be obtained from 
the fully relativistic Dirac equation for 
an electron in a Coulomb potential by 
making a low-velocity approximation, 
see Sakurai (1967). That quantum me- 
chanical approach justifies treating 1 
and s as operators. 


48 Using the approximation g,; —1~ 1. 


Fig. 2.4 The orbital and spin angular 
momenta add to give a total atomic an- 
gular momentum of j. 


491n this precession about j the magni- 
tudes of 1 and s remain constant. The 
magnetic moment (proportional to s) 
is not altered in an interaction with 
a magnetic field, and because of the 
symmetrical form of the interaction in 
eqn 2.49, we do not expect 1 to be- 
have any differently. See also Blundell 
(2001) and Section 5.1. 
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50 As shown in Section 1.9, Meacap = 
hand hcReo = (e?/47€0)/(2a0). 


51 Another short form found in the lit- 
erature is 2?Py/. and 2?P3/p. 


where the spin-orbit constant 6 is (from-eqns 2.51 and 2.23) 


ie Vee 1 (2.55) 
b= G20 Ge ree | 
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A single electron has s = , so, for each J, its total angular momentum 


quantum number j has two possible values: 


it ih 
g=t- 5 oF [— 5° 
From eqn 2.54 we find that the energy interval between these levels, 
AE,—9 = j=l+i aes is 
a? hcRso 


AE,-o = B (i+ 4) = ea 1): (2.56) 


Or, expressed in terms of the gross energy E(n) in eqn 1.10,°° 


a2 


This agrees with the qualitative discussion in Section 1.4, where we 
showed that relativistic effects cause energy changes of order a” times 
the gross structure. The more complete expression above shows that the 
energy intervals between levels decrease as n and I increase. The largest 
interval in hydrogen occurs for n = 2 and / = 1; for this configuration 
the spin-orbit interaction leads to levels with 7 = 1/2 and 7 = 3/2. 
The full designation of these levels is 2p?P 1/2 and 2p?P3/2, in the no- 
tation that will be introduced for the LS-coupling scheme. But some of 
the quantum numbers (defined in Chapter 5) are superfluous for atoms 
with a single valence electron and a convenient short form is to denote 
these two levels by 2 Pj /2 and 2P3/2; these correspond to nP;, where 
P represents the (total) orbital angular momentum for this case. (The 
capital letters are consistent with later usage.) Similarly, we may write 
2S1/2 for the 2s7S1/2 level; 3D3/2 and 3D5/2 for the j = 3/2 and 5/2 
levels, respectively, that arise from the 3d configuration.°! But the full 
notation must be used whenever ambiguity might arise. 


AEs-o = 


2.3.3 The fine structure of hydrogen 


As an example of fine structure, we look in detail at the levels that arise 
from the n = 2 and n = 3 shells of hydrogen. Equation 2.54 predicts 
that, for the 2p configuration, the fine-structure levels have energies of 


Es-o (2 Py /2) = —Prp, 
E,—o (2 P3/2) = + Bop 


as shown in Fig. 2.5(a). For the 3d configuration 


3 
Es—o (3 D3/2) = — Psa» 
Es—o (3 Ds/2) = Bsa, 


as shown in Fig. 2.5(b). For both configurations, it is easy to see that 
the spin-orbit interaction does not shift the mean energy 


E = (23 + 1) Bj(n,0) + (27+ 1) Ej (n,)), (2.58) 
where j’ = /1—1/2 and j = 1+ 1/2 for the two levels. This calculation of 
the ‘centre of gravity’ for all the states takes into account the degeneracy 
of each level. 

The spin-orbit interaction does not affect the 25)/2 or 35;/2 so we 
might expect these levels to lie close to the centre of gravity of the 
configurations with | > 0. This is not the case. Fig. 2.6 shows the 
energies of the levels for the n = 3 shell given by a fully relativistic 
calculation. We can see that there are other effects of similar magnitude 
to the spin-orbit interaction that affect these levels in hydrogen. Quite 
remarkably, these additional relativistic effects shift the levels by just the 
right amount to make n P;/2 levels degenerate with the n Sj /2 levels, and 
n P3/2 degenerate with n D3/2. This structure does not occur by chance, 
but points to a deeper underlying cause. The full explanation of these 
observations requires relativistic quantum mechanics and the technical 
details of such calculations lie beyond the scope of this book.®? We shall 
simply quote the solution of the Dirac equation for an electron in a 
Coulomb potential; this gives a formula for the energy Epirac (n,j) that 
depends only on n and j, i.e. it gives the same energy for levels of the 
same n and j but different J, as in the cases above. In a comparison of the 
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(a) j= eo 
No spin-orbit Area 
interaction 

jaye 
(b) j= 5/2 
No spin-orbit AF, 
interaction 3 Bsa 
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Fig. 2.5 The fine structure of hydro- 
gen. The fine structure of (a) the 2p 
and (b) the 3d configurations are drawn 
on different scales: 2p is consider- 
ably greater than B3g. All p- and d- 
configurations look similar apart from 
an overall scaling factor. 


52 See graduate-level quantum mechan- 
ics texts, e.g. Sakurai (1967) and Series 
(1988). 


Relativistic 
mass 


Fig. 2.6 The theoretical positions of the energy levels of hydrogen calculated by the fully relativistic theory of Dirac depend 
on n and j only (not J), as shown in this figure for the n = 3 shell. In addition to the spin-orbit interaction, the effects that 
determine the energies of these levels are: the relativistic mass correction and, for s-electrons only, the Darwin term (that 
accounts for relativistic effects that occur at small r, where the electron’s momentum becomes comparable to mec). 
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53-The Dirac equation predicts that the 
electron has gs = 2 exactly. 


547 amb and Retherford used a radio- 
frequency to drive the 25; /9-2Pi/2 
transition directly. This small en- 
ergy interval, now know as the Lamb 
shift, cannot be resolved in conven- 
tional spectroscopy because of Doppler 
broadening, but it can be seen us- 
ing Doppler-free methods as shown in 
ses, tei fo 


55The QED calculation of the Lamb 
shift is described in Sakurai (1967). 


°6 Broadly speaking, in a mathemat- 
ical treatment these vacuum fluctua- 
tions correspond to the zero-point en- 
ergy of quantum harmonic oscillators, 
i.e. the lowest energy of the modes of 
the system is not zero but hw/2. 


57OED also explains why the g-factor 
of the electron is not exactly 2. Pre- 
cise measurements show that g; = 
2.002 319 3043718 (current values for 
the fundamental constants can be 
found on the NIST web site, and those 
of other national standards laborato- 
ries). See also Chapter 12. 


exact relativistic solution of the Dirac equation and the non-relativistic 
energy levels, three relativistic effects can be distinguished. 


(a) There is a straightforward relativistic shift of the energy (or equiva- 
lently mass), related to the binomial expansion of y = (1—v?/e?)~/2, 
in eqn 1.16. The term of order v?/c? gives the non-relativistic Ie 
netic energy p?/2me. The next term in the expansion is propor- 
tional to v*/c* and gives an energy shift of order v?/c? times the 
gross structure—this is the effect that we estimated in Section 1.4. 


For electrons with | 4 0, the comparison of the Dirac and Schrodinger 
equations shows that there is a spin-orbit interaction of the form 
given above, with the Thomas precession factor naturally included.*? 


(b) 


For electrons with 1 = 0 there is a Darwin term proportional to 
|) (r = 0)|” that has no classical analogue (see Woodgate (1980) for 
further details). 


(c) 


That these different contributions conspire together to perturb the 
wavefunctions such that levels of the same n and 7 are degenerate seems 
improbable from a non-relativistic point of view. It is worth reiterat- 
ing the statement above that this structure arises from the relativistic 
Dirac equation; making an approximation for small v?/c* shows that 
these three corrections, and no others, need to be applied to the (non- 
relativistic) energies found from the Schrédinger equation. 


2.3.4 The Lamb shift 


Figure 2.7 shows the actual energy levels of the n = 2 and n = 3 shells. 


According to. relativistic quantum 1 theory t the 1¢ 25/2 level should be ex- 


RPE cane enorme 


actly degenerate with 2 Pj /» because they both have n= 2 and ae. 


but_in reality there is an_energy interval between them, _E (2 Si/2) — 
E(2Pij2) ~ 1GHz. The shift of the 2Sj/ level to a higher energy 


“(lower binding energy) than the Epjrac(n = 2,j = 1/2) is about one- 
tenth of the interval b between 1 ‘the two fine- structure levels, E(2P; 72) — 


Baa Emi Py ys /2) = 11GHz. Although small, this discrepancy in hydrogen was 


_of great_historical importance in physics. For this simple one-electron 


atom the predictions of the Dirac_equation are very precise and that 
theory cannot account for Lamb and Retherford’s experimental mea- 


surement that the Pe level j is ‘indeed | higher. than the 2Pi42 level. ae 


The he explanation of this Lamb shift goes beyond relativistic quantum me- 


chanics and requires quantum _clectrodynamics (QED)—the quantum 
field theory that describes electromagnetic interactions. Indeed, the ob- 
servation of the Lamb shift experiment was a stimulus for the develop- 
ment of this theory.”? An intriguing feature of QED is so-called vacuum 


fluctuations — regions of free space are not regarded as being completely 


empty but _are permeated by fluctuating electromagnetic fields.>® The 
QED effects lead 1 to_a_significant. energy shift for Sectrons with | = 0 
and hence break the degeneracy of 2 Sj /2 and 2] 2 Pip? ’ The larg rgest QED 
shift occurs for the 15, /. ground level of h hydrogen | but. ‘there i is no other 


level nearby and so a determination of its energy requires a precise mea- 
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surement of a large frequency. Nowadays this can be achieved by laser 


spectroscopy NOH o Kes 8) but the near degeneracy of une two j = 1 V2 
levels with n An 


portant fe Stores in that experiment was the » metastability of the 2S [2 


level, whose se lifetime wa le Was given in Section wie 3. ‘That level decays ru Oil 


times more slowly “than that ¢ of 2P ij. 


gen (a at room temperature) the _atoms have typical velocities of about 
nn a 
oer and atoms excited into the 2p “configuration travel. al_aver- 


oon a radiation. In x metastable aps travel the full length 
of. the apparatus (~ 1m) and are de-excited when they collide with a 


detector (or the wall of the vacuum chamber). Hydrogen, and hydro- 
genic systems, are still used for experimental tests of fundamental theory 


because_their simplicity allows very precise predictions. 


2.3.5 Transitions between fine-structure levels 


Transitions in hydrogen between the fine-structure levels with principal 
quantum numbers n = 2 and 3 give the components of the Balmer-a 
line shown in Fig. 2.7; in order of increasing energy, the seven allowed 


In an atomic beam of hydro- 
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Hig? 2.7 The fing structure of the 


ne and n = 3 shells of hi hydro- 


gen j 1 and the allowed transitions between 
the levels. According to the Dirac 


equation, the 2S,/2 and 2Pj/2 levels 
should be degenerate, but they are not. 
pelt: measured positions show that the 


2s?S, /2\ level is shifted upwards relative 
to the position Epirac(n = 2, 7 = 172), 
and_is therefore not degenerate_ with | 
the 2p “ 2P, [2 level. Such a shift occurs. 
for all the s- electrons (but_the size of 
the energy shift decreases with increas-_ 


ing 7). is st 


The explanation of this shift 
takes us beyond relativistic quantum 
mechanics into the realm of quantum 
electrodynamics (QED)—the quantum. 
field theory that describes electromag- 
netic interactions. 


Fig. 2.8 The conservation of total an- 
gular momentum in electric dipole tran- 
sitions that gives the selection rule in 
eqn 2.59 can be represented as vector 
addition. The photon has one unit of 
angular momentum, and so to go from 
level j1 to ja the vectors must form a 
triangle, as shown for the case of (a) 
vB = 1/2 ie) I) = iV, (b) A= W2 to 
42.—13/2,ande(G)) 91) 10/2 OM ayes 


°8 Higher shells have smaller shifts be- 
tween the j = 1/2 levels. 
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transitions between the levels with different 7 are as follows: 


2P3/2 —351/2; 
2P3/2—3D3;/2, 
2P3/2 —3Ds;2,; 
2Sij2—-3Pi;e2, 
2Pi/2—-3S8i;/2,; 
2S1/2 — 3P3/2, 
2Py oe. 


These obey the selection rule AJ = +1 but an additional rule prevents a 
transition between 3D5/2 and 2 Pj/2, namely that the change of the totat 


angular momentum _quantum number in an electric dipole transition 
obeys. 


Aj =0,+1. (2.59) 


This selection rule may be explained by angular momentum conserva- 
tion (as mentioned in in Section 2 OS a This rule can be expressed i in terms 
of vector. addition,. as.shown in Fig. yo 8; the cc conservation condition is 
equivalent to being able to form a triangle from the three vectors rep- 
resenting j of the initial state, the final state, anda unit.vector_for the 
(one unit of) angular momentum carried by the photon. Hence, this se- 
lection rule is sometimes referred to as the triangle rule. The projection 
of j along the z-axis can change by Am; = 0,+1. (Appendix C gives a 
summary of all selection rules.) 


Further reading 


Much of the material covered in this chapter can be found in the intro- 
ductory quantum mechanics and atomic physics texts listed in the Ref- 
erences. For particular topics the following are useful: Segré (1980) gives 
an overview of the historical development, and Series (1988) reviews the 
work on hydrogen, including the important Lamb shift experiment. 


| — | 
Exercises 
(2.1) Angular-momentum eigenfunctions (2.2) Angular-momentum eigenfunctions 

(a) Verify that all the eigenfunctions with | = 1 (a) Find the eigenfunction with orbital angular 


are orthogonal to Yo,o 


momentum quantum number / and magnetic 


(b) Verify that all the eigenfunctions with | = 1 quantum number m = 1 — 1. 
are orthogonal to those with | = 2. (b) Verify that Y;,;-1 is orthogonal to Yj—-1,:-1. 


(2.3) 


(2.4) 


(2.5) 


(2.7) 


(2.8 


ice 


(2.9) 


(2.10) 


Radial wavefunctions 

Verify eqn 2.23 for n = 2,1 = 1 by calculating the 
radial integral (for Z = 1). 

Hydrogen 

For a hydrogen atom the normalised wavefunction 
of an electron in the 1s state, assuming a point 


nucleus, is 
1 \1/2 ’ 
4 = a spat ALE by 
wey=(a) ere, 


where ao is the Bohr radius. Find an approximate 
expression for the probability of finding the elec- 
tron in a small sphere of radius ry < ao centred on 
the proton. What is the electronic charge density 
in this region? 

Hydrogen 

The Balmer-a spectral line is observed from a 
(weak) discharge in a lamp that contains a mixture 
of hydrogen and deuterium at room temperature. 
Comment on the feasibility of carrying out an ex- 
periment using a Fabry—Perot étalon to resolve (a) 
the isotope shift, (b) the fine structure and (c) the 
Lamb shift. 


Transitions 

Estimate the lifetime of the excited state in a two- 
level atom when the transition wavelength is (a) 
100 nm and (b) 1000 nm. In what spectral regions 
do these wavelengths lie? 


Selection rules 

By explicit calculation of integrals over 6, for the 
case of 7-polarization only, verify that p to d tran- 
sitions are allowed, but not s to d. 


Spin—orbit interaction 

The spin-orbit interaction splits a single-electron 
configuration into two levels with total angular 
momentum quantum numbers j = / + 1/2 and 
j’ =1—1/2. Show that this interaction does not 
shift the mean energy (centre of gravity) of all the 
states given by (27 + 1) Ej + (27’ +1) Ej. 
Selection rule for the magnetic quantum number 
Show that the angular integrals for o-transitions 
contain the factor 


2n 
/ gil, —™1p£1)¢ dd. 
0 


Hence derive the selection rule Am; = +1 for this 
polarization. Similarly, derive the selection rule 
for the m-transitions. 


Transitions 
An atom in a superposition of two states has the 


Gr) 
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wavefunction 
W(t) = Ay (r) 12 1*/" 4 Bobo (r) 7 E2*/" 
The distribution of electronic charge is given by 
—e|W (t)|? = —e{ |Ayi|? + |By2|? 
+ |2A* By ol cos (wiat — ¢) }. 


Part of this oscillates at the (angular) frequency 

of the transition wi2 = we — Ww; = (EH2 — F,) /h. 

(a) A hydrogen atom is in a superposition of the 
1s ground state, #1 = R1,0 (r) Yo,0 (6, ¢), and 
the m; = 0 state of the 2p configuration, wo = 
R21 (r) ¥i,o (0,¢); A ~ 0.995 and B = 0.1 
(so the term containing B? can be ignored). 
Sketch the form of the charge distribution for 
one cycle of oscillation. 

(b) The atom in a superposition state may have 

an oscillating electric dipole moment 


—eD (t) = —e(W" (t) rWV (t)). 


What are the conditions on y and we for 
which D (t) #0. 


Show that an atom in a superposition of the 
same states as in part (a) has a dipole moment 
of 


—eD (t) = —e |2A* B| Zang 
® { f rRealr)Rio(r)r ar} cos(wi2t)ez , 


where Zang is an integral of the angular parts 
of the wavefunction with respect to @ and @. 
Calculate the amplitude of this dipole, in units 
Ob ean, 106 A — 9 11/472. 


A hydrogen atom is in a superposition of the 
1s ground state and the m; = 1 state of the 2p 
configuration, 2 = Re (r) Yi,1 (@,¢). Sketch 
the form of the charge distribution at various 
points in its cycle of oscillation. 


(c) 


(d 


= 


(e) Comment on the relationship between the 
time dependence of the charge distributions 
sketched in this exercise and the motion of the 
electron in the classical model of the Zeeman 


effect (Section 1.8). 


Angular eigenfunctions 

We shall find the angular momentum eigenfunc- 
tions using ladder operators, by assuming that for 
some value of / there is a maximum value of the 
magnetic quantum number mMmax. For this case 
Yinmax « O(O)e"™=*? and the function 0(6) can 
be found from 


1, @(8) exp (immax¢) = 0. 
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(a) Show that ©O(6) satisfies the equation Hence, or otherwise, prove that Zang is zero unless 
1 00(6) bose the initial and final states have opposite parity. 


@6@) 006. ™*sind’ 


(2.13) Selection rules in hydrogen 
Hydrogen atoms are excited (by a pulse of laser 


(b) Find the solution of the equation for ©(6). light that drives a multi-photon process) to a spe- 

(Both sides have the form f'(6)/f(@) whose cific configuration and the subsequent spontaneous 

integral is In{f(6)}.) By substituting emission is resolved using a spectrograph. Infra- 

this solution into eqn 2.5 to show that red and visible spectral lines are detected only at 

b = Mmax(™Mmax + 1), or otherwise, obtain the wavelengths 4.05 um, 1.87pm and 0.656 pm. 

eqn 2.10. Explain these observations and give the values of 

(2.12) Parity and selection rules nm and I for the configurations involved in these 


Show that eqn 2.42 implies that lz — 1, is odd. transitions. 


Helium 


Helium has only two electrons but this simplicity is deceptive. To treat 
systems with two particles requires new concepts that also apply to 
multi-particle systems in many branches of physics, and it is very worth- 
while to study them carefully using the example of helium. There is 
truth in the saying that atomic physicists count ‘one, two, many’ and a 
detailed understanding of the two-electron system is sufficient for much 
of the atomic structure in this book.! 


3.1 The ground state of helium 


Two electrons in the Coulomb potential of a charge Ze, e.g. the nucleus 
of an atom, obey a Schrédinger equation of the form 


_ #2 _ 72 2 
{vis eui+ £ (-2-242)hy= ne. (3.1) 


Gy r2 T12 


Here rj2 = |r; — r2| is the distance between electron 1 and electron 2 and 
the electrostatic repulsion of electrons is proportional to 1/rj2. Neglect- 
ing this mutual repulsion for the time being, we can write the equation 
as 


(Hy + Hz) yp = EY, (3.2) 

where f ; 
= — Vi - BS 
mA 2m Na ATEQr Cee 


and Hz is a similar expression for electron 2. Writing the atomic 
wavefunction as a product of the wavefunctions for each electron, y = 
w (1)%(2), allows us to separate eqn 3.2 into two single-electron 
Schrodinger equations: 

Fy (1) = Ex (1) (3.4) 
and a similar equation for y~ (2) with energy 2. The solutions of these 
one-electron equations are hydrogenic wavefunctions with energies given 
by the Rydberg formula. Helium has Z = 2 and in its ground state both 
electrons have energy E, = Ey = —4hcR~ = —54.4eV. Thus the total 
energy of the atom (neglecting repulsion) is 


E©) = FE, + Ey = -109eV. (3.5) 
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3.3 Evaluation of the integrals 


in helium 53 
Further reading 56 
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1 This book considers only those multi- 
electron systems with one, or two, va- 
lence electrons ‘outside’ a spherically- 
symmetric core of charge. 
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24 //4n is the angular part of an s- 
electron wavefunction. 


3This is a standard quantum mechan- 
ical technique whose mathematical de- 
tails are given in quantum texts. The 
essential principle of this technique is 
to find an expression for the energy 
in terms of a parameter—an effective 
atomic number in the case of helium— 
and then minimise the energy with re- 
spect to this parameter, i.e. study the 
variation in the energy as a function of 
the chosen parameter. 


4This is accessible at http://www. 
oup.co.uk/best .textbooks/physics/ 
ephys/illustrations/ along with 
other illustrations of elementary 
quantum ideas. 


51t is often said that ‘one electron is 
in a Sspin-up state and the other is spin- 
down’; what this really means is defined 
in the discussion of spin for the excited 
states of helium. 


6 The spatial wavefunction u contains 
both radial and angular parts but the 
energy does not depend on the mag- 
netic quantum number, so we drop m as 
a subscript on u. The repulsion from a 
spherically-symmetric 1s wavefunction 
does not depend on the orientation of 
the other electron. To show this mathe- 
matically we could carry m through all 
the calculations and examine the result- 
ing angular integrals, but this is cum- 
bersome. 


Now we need to calculate the perturbation produced by the electron— 
electron repulsion. The system has the spatial wavefunction 


x = 1 
tie = Ri? (m1) Ris” (r2) x 7 (3.6) 


Ar 


where radial wavefunctions are defined in Table 2.2.2 The expectation 
value of the repulsion is (see Section 3.3) 


Pe (fe il 

i i Vr2 —W12 7} dri 73 dro = 34eV. (34) 
A4te9 Jo Jo T12 
Adding this to the (zeroth-order) estimate EZ) gives an energy of E(1s”) 
= —109+ 34 = —75eV. It takes an energy of 75eV to remove both 
electrons from a helium atom leaving a bare helium nucleus Hett— 
the second ionization energy. To go from Het to He** takes 54.4eV, 
so this estimate suggests that the first ionization energy (required to 
remove one electron from He to create Het) is IE(He) ~ 75 — 54 = 
21eV. But the expectation value in eqn 3.7 is not small compared to 
the binding energy and therefore the perturbation has a significant effect 
on the wavefunctions. The necessary adjustment of the wavefunctions 
can be accounted for by the variational method.? This method gives a 
value close to the measured ionization energy 24.6eV. Helium has the 
highest first ionization energy of all elements because of its closed n = 1 
shell. For a plot of the ionization energies of the elements see Grant and 
Phillips (2001, Chapter 11, Fig. 18).* 

According to the Pauli exclusion principle, two electrons cannot have 
the same set of quantum numbers. Therefore there must be some ad- 
ditional quantum number associated with the two 1s-electrons in the 
ground state of helium—this is their spin (introduced in Section 2.3.1). 
The observed filling-up of the atomic (sub-)shells in the periodic table 
implies that two spin states are associated with each set of spatial quan- 
tum numbers n, 1, m;.° However, electrostatic energies do not depend 
on spin and we can find the spatial wavefunctions separately from the 
problem of finding the spin eigenfunctions. 


3.2 Excited states of helium 


To find the energy of the excited states we use the same procedure as 
for the ground state-——at first we neglect the mutual repulsion term and 
separate eqn 3.1 into two one-electron equations that have solutions® 


uis(1) — Ris(r1) x aan 
Unt (2) = Rni(T2)Yijm (62, $2) 


for the configuration 1sni. The spatial part of the atomic wavefunction 
is the product 


Uspace = Uis(1) Uns (2) . (3.8) 


Another wavefunction has the same energy, namely 


Wspace = U1s(2)Uns(1) . (3.9) 


These two states are related by a permutation of the labels on the elec- 
trons, 1 «+ 2; the energy cannot depend on the labeling of identical 
particles so there is exchange degeneracy. To consider the effect of the 
repulsive term on this pair of wavefunctions with the same energy (de- 
generate states) we need degenerate perturbation theory. There are two 
approaches. The look-before-you-leap approach is first to form eigen- 
states of the perturbation from linear combinations of the initial states.’ 
In this new basis the determination of the eigenenergies of the states is 
simple. It is instructive, however, simply to press ahead and go through 
the algebra once.® 
We rewrite the Schrodinger equation (eqn 3.1) as 


(Ho + H')p = Ey, (3.10) 


where Hp = H; + Ho, and we consider the mutual repulsion of the 
electrons H’ = e?/47e€orj2 as a perturbation. We also rewrite eqn 3.2 as 


Hop = Ey, (3.11) 


where E(°) = EF, + E> is the unperturbed energy. Subtraction of eqn 
3.11 from eqn 3.10 gives the energy change produced by the perturbation, 
AE =E- E, as 

H'p=AEW. (3.12) 


A general expression for the wavefunction with energy E() is a linear 
combination of expressions 3.8 and 3.9, with arbitrary constants a and 
b, 

= aUis(1) Uni (2) + buis(2)Uni(1). (3.13) 


Substitution into eqn 3.12, multiplication by either uj,(1)u*,(2) or 
ui,(2)u*,(1), and then integration over the spatial coordinates for each 
electron (71,61,¢1 and rz, 02,2) gives two coupled equations that we 


(5) G)=22@) 


This is eqn 3.12 in matrix form. The direct integral is 


1 e? 9) 
— ee // juis(1)|? rib \tnz(2)} dr} drs 
== i / Pra(ra)Pnt(2) ays aed (3.15) 
ATE T12 
where pis(1) = —e|uis(1)|” is the charge density distribution for elec- 


tron 1, and similarly for pj )(2). This direct integral represents the 
Coulomb repulsion of these charge clouds (Fig. 3.1). The exchange inte- 
gral is 


il 


in Atreég 


‘l uj,(1)uz,(2) = uss(2)uni( dr? dr3 . (3.16) 
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This is guided by looking for eigen- 
states of symmetry operators that com- 
mute with the Hamiltonian for the in- 
teraction, as in Section 4.5. 


8In the light of this experience one can 
take the short cut in future. 
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Fig. 3.1 The direct integral in a lsns 
configuration of helium corresponds to 
the Coulomb repulsion between two 
spherically-symmetric charge clouds 
made up of shells of charge like those 
shown. 


1p 
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J 
ene ¥...Asnl 
Unperturbed 
configuration 


Fig. 3.2 The effect of the direct and 
exchange integrals on a lsnl config- 
uration in helium. The singlet and 
triplet terms have an energy separation 
of twice the exchange integral (2K). 


"lt is easy to check which wavefunc- 
tion corresponds to which eigenvalue by 
substitution into the original equation. 


10 4 nother example is the classical 
treatment of the normal Zeeman effect. 


Unlike the direct integral, this does not have a simple classical interpre- 
tation in terms of charge (or probability) distributions—the exchange 
integral depends on interference of the amplitudes. The spherical sym- 
metry of the 1s wavefunction makes the integrals straightforward to 
evaluate (Exercises 3.6 and 3.7). 

The eigenvalues AE in eqn 3.14 are found from 


je ‘Kees 
K J2 age 


The roots of this determinantal equation are AE = J+ K. The direct 
integral shifts both levels together but the exchange integral leads to an 
energy splitting of 2K (see Fig. 3.2). Substitution back into eqn 3.14 
gives the two eigenvectors in which b = a and b = —a. These correspond 
to symmetric (S) and antisymmetric (A) wavefunctions: 


0. (3.17) 


USvace = { ttrs(1)tinr(2) + this (2)unr(1) } 


Vosace = 5 { tine (1)ttnt(2) — 119(2)ttnt(1) }- 


The wavefunction w3\,.¢ has an eigenenergy of E\) + J — K, and this 
is lower than the energy E + J+ K for Wace. (For the snl con- 
figurations in helium K is positive.)® This is often interpreted as the 
electrons ‘avoiding’ each other, i.e. eat = 0 for rm) = ro, and for this 
wavefunction the probability of finding electron 1 close to electron 2 is 
small (see Exercise 3.3). This anticorrelation of the two electrons makes 
the expectation of the Coulomb repulsion between the electrons smaller 
than for ee: 

The occurrence of symmetric and antisymmetric wavefunctions has a 
classical analogue illustrated in Fig. 3.3. A system of two oscillators, 
with the same resonance frequency, that interact (e.g. they are joined 
together by a spring) has antisymmetric and symmetric normal modes 
as illustrated in Fig. 3.3(b) and (c). These modes and their frequencies 
are found in Appendix B as an example of the application of degenerate 
perturbation theory in Newtonian mechanics.'° 

The exchange integral decreases as n and | increase because of the 
reduced overlap between the wavefunctions of the excited electron and 


a. 


— 


the 1s-electron. These trends are an obvious consequence of the form of 
the wavefunctions: the excited electron’s average orbit radius increases 
with energy and hence with n; the variation with | arises because the 
effective potential from the angular momentum (‘centrifugal’ barrier) 
leads to the wavefunction of the excited electron being small at small 
r. However, in the treatment as described above, the direct integral 
does not tend to zero as n and / increase, as shown by the following 
physical argument. The excited electron ‘sees’ the nuclear charge of +2e 
surrounded by the Is electronic charge distribution, i.e. in the region far 
from the nucleus where nl-electron’s wavefunction has a significant value 
it experiences a Coulomb potential of charge +le. Thus the excited 
electron has an energy similar to that of an electron in the hydrogen 
atom, as shown in Fig. 3.4. But we have started with the assumption 
that both the 1s- and nl-electrons have an energy given by the Rydberg 
formula for Z = 2. The direct integral J equals the difference between 
these energies.1! This work was an early triumph for wave mechanics 
since previously it had not been possible to calculate the structure of 
helium.!? 

In this section we found the wavefunctions and energy levels in helium 
by direct calculation but looking back we can see how to anticipate the 
answer by making use of symmetry arguments. The Hamiltonian for the 
electrostatic repulsion, proportional to 1/riz = 1/|r1 — re|, commutes 
with the operator that interchanges the particle labels 1 and 2, i.e. the 
swap operation 1 4 2. (Although we shall not give this operator a 
symbol it is obvious that it leaves the value of 1/ri2 unchanged.) Com- 
muting operators have simultaneous eigenfunctions. This prompts us 
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Fig. 3.3 An illustration of degenerate 
perturbation in a classical system. (a) 
Two harmonic oscillators with the same 
oscillation frequency wo—each spring 
has a mass on one end and its other end 
is attached to a rigid support. An in- 
teraction, represented here by another 
spring that connects the masses, cou- 
ples the motions of the two masses. The 
normal modes of the system are (b) 
an in-phase oscillation at wo, in which 
the spring between the masses does not 
change length, and (c) an out-of-phase 
oscillation at a higher frequency. Ap- 
pendix A gives the equations for this 
system of two masses and three springs, 
and also for the equivalent system of 
three masses joined by two springs that 
models a triatomic molecule, e.g. car- 
bon dioxide. 


11-This can also be seen from eqn 3.15. 
The integration over 71, 6, and ¢4 
leads to a repulsive Coulomb potential 
~ e/Amegre that cancels part of the at- 
tractive potential of the nucleus, when 
r2 is greater than the values of r; where 
a1 is appreciable. 


12 For hydrogen, the solution of 
Schrodinger’s equation reproduced 
the energy levels calculated by the 
Bohr-Sommerfeld theory. However, 
wave mechanics does give more in- 
formation about hydrogen than the 
old quantum theory, e.g. it allows the 
detailed calculation of transition rates. 
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Fig. 3.4 The energy levels of the helium atom with those of hydrogen for comparison. The 1s? ground configuration is tightly 
bound. For the excited configurations of helium the 1s-electron screens the outer electron from the nuclear charge so that the 
isni configurations in helium have similar energy to the shell with principal quantum number n in hydrogen. The hydrogenic 
levels are indicated on the right. The interval between the 'L and 3L terms (equal to twice the exchange integral) is clear for 
the 1s2s, 1s2p, 1s3s, 1s3p and 1s4s configurations but it is smaller for higher n and l. 


to construct the symmetrised wavefunctions ~/A,.¢ and YW ,ace:!* In this 
basis of eigenstates it is simple to calculate the effect of the electrostatic 


repulsion. 


3.2.1 Spin eigenstates 


The electrostatic repulsion between the two electrons leads to the wave- 
functions se and Wee in the excited states of the helium atom. The 
ground state is a special case where both electrons have the same spatial 
wavefunction, so only a symmetric solution exists. We did not consider 
spin since electrostatic interactions depend on the charge of the particle, 
not their spin. Neither Hp nor H' contains any reference to the spin 
of the electrons. Spin does, however, have a profound effect on atomic 
wavefunctions. This arises from the deep connection between spin and 
the symmetry of the wavefunction of indistinguishable particles.'4 Note 
that here we are considering the total wavefunction in the systems that 
includes both the spatial part (found in the previous section) and the 
spin. Fermions have wavefunctions that are antisymmetric with respect 
to particle-label interchange, and bosons have symmetric ones. As a 
consequence of this symmetry property, fermions and bosons fill up the 
levels of a system in different ways, i.e. they obey different quantum 
statistics. 

Electrons are fermions so atoms have total wavefunctions that are 
antisymmetric with respect to permutation of the electron labels. This 
requires (ae to associate with an antisymmetric spin function Wein 
and the other way round: 


aS A A S 
~ a UD saeeteaoin or Vspace spin - 


These antisymmetrised wavefunctions that we have constructed fulfil the 
requirement of having particular symmetry with respect to the inter- 
change of indistinguishable particles. Now we shall find the spin eigen- 
functions explicitly. We use the shorthand notation where ¢ and | repre- 
sent m, = 1/2 and —1/2, respectively. Two electrons have four possible 
combinations: the three symmetric functions, 


Pe ain ze Itt) 
1 
=a {IN) +1) 
= |W), 


corresponding to S = 1 and Ms = +1,0,—1; and an antisymmetric 
function 


(3.18) 


(3.19) 


A 
spin 


1 
= Fa iit}, 


corresponding to S = 0 (with Ms = 0).° Spectroscopists label the eigen- 
states of the electrostatic interactions with the symbol 2°+!L, where S 
is the total spin and L is the total orbital angular momentum quantum 
number. The Isni configurations in helium L = I, so the allowed terms 


(3.20) 
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13For two electrons, swapping the 
particle labels twice brings us back 
to where we started, so ~(1,2) = 
+7 (2,1). Therefore the two possible 
eigenvalues are 1 for WS,ace and —1 for 


A 
Wspace* 


147 ndistinguishable means that the 
particles are identical and have the free- 
dom to exchange positions, e.g. atoms 
in a gas which obey Fermi—Dirac or 
Bose-Einstein statistics depending on 
their spin. In contrast, atoms in a 
solid can be treated as distinguishable, 
even if they are identical, because they 
have fixed positions—we could label the 
atoms 1, 2, etc. and still know which is 
which at some later time. 


15-These statements about the result 
of adding two s=1/2 angular mo- 
menta can be proved by formal angu- 
lar momentum theory. Simplified treat- 
ments describe S = 0 as having one 
electron with ‘spin-up’ and the other 
with ‘spin-down’; but both Ms = 0 
states are linear combinations of the 
states |ms1 = +1/2, ms2 = —1/2) and 
(ide = SLY, Wbcey = SEL. 
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16The letter ‘S’ appears over-used in 
this established notation but no ambi- 
guity arises in practice. The symbol S 
for the total spin is italic because this 
is a variable, whereas the symbols S for 
EL =0 and s for! = 0 are not italic. 


17This anticipates a more general dis- 
cussion of this and other selection rules 
for the LS-coupling scheme in a later 
chapter. 


are 'L and 3L, e.g. the 1s2s configuration in helium gives rise to the 
terms 'S and 3S, where S represents L = 0.'® 

In summary, we have calculated the structure of helium in two distinct 
stages. 

(1) Energies Degenerate perturbation theory gives the space wave- 
functions Dee and eee with energies split by twice the exchange 
integral. In helium the degeneracy arises because the two electrons 
are identical particles so there is exchange degeneracy, but the treat- 
ment is similar for systems where a degeneracy arises by accident. 


(2) Spin We determined the spin associated with each energy level by 
constructing symmetrised wavefunctions. The product of the spa- 
tial functions and the spin eigenstates gives the total atomic wave- 
function that must be antisymmetric with respect to particle-label 
interchange. 


Exchange degeneracy, exchange integrals, degenerate perturbation the- 
ory and symmetrised wavefunctions all occur in helium and their inter- 
relationship is not straightforward so that misconceptions abound. A 
common misinterpretation is to infer that because levels with different 
total spin, S = 0 and 1, have different energies then there is a spin- 
dependent interaction—this is not correct, but sometimes in condensed 
matter physics it is useful to pretend that it is! (See Blundell 2001.) 
The interactions that determine the gross structure of helium are en- 
tirely electrostatic and depend only on the charge and position of the 
particles. Also, degenerate perturbation theory is sometimes regarded 
as a mysterious quantum phenomenon. Appendix A gives further dis- 
cussion and shows that symmetric and antisymmetric normal modes 
occur when two classical systems, with similar energy, interact, e.g. two 
coupled oscillators. 


3.2.2 Transitions in helium 


To determine which transitions are allowed between the energy levels 
of helium we need a selection rule for spin: the total spin quantum 
number does not change in electric dipole transitions. In the matrix 
element (Wgnai|T|Yinitial) the operator r does not act on spin; therefore, 
if the Yanai and Winitiai do not have the same value of S, then their spin 
functions are orthogonal and the matrix element equals zero.!? This 
selection rule gives the transitions shown in Fig. 3.5. 
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Singlets Triplets 
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3.3 Evaluation of the integrals in helium 


In this section we shall calculate the direct and exchange integrals to 
make quantitative predictions for some of the energy levels in the helium 
atom, based on the theory described in the previous sections. This 
provides an example of the use of atomic wavefunctions to carry out a 
calculation where there are no corresponding classical orbits and gives 
an indication of the complexities that arise in systems with more than 
one electron. The evaluation of the integrals requires care and some 
further details are given in Appendix B. The important point to be 
learnt from this section, however, is not the mathematical techniques 
but rather to see that the integrals arise from the Coulomb interaction 
between electrons treated by straightforward quantum mechanics. 


3.3.1 Ground state 


To calculate the energy of the 1s? configuration we need to find the ex- 
pectation value of e7/47€9ri2 in eqn 3.1—this calculation is the same 
as the evaluation of the mutual repulsion between two charge distribu- 
tions in classical electrostatics, as in eqn 3.15 with p15(r1) and pyi(r2) = 
Pis(’2)- The integral can be considered in different ways. We could 
calculate the energy of the charge distribution of electron 1 in the po- 
tential created by electron 2, or the other way around. This section does 
neither; it uses a method that treats each electron symmetrically (as in 
Appendix B), but of course each approach gives the same numerical re- 
sult. Electron 1 produces an electrostatic potential at radial distance rg 


given by 
T? il 
Vie (r2) = i 
0 


4tenri2 


p(ri) d°r; . (3.21) 


Fig. 3.5 The allowed transitions be- 
tween the terms of helium are governed 
by the selection rule AS = 0 in addi- 
tion to the rule Al = +1 found pre- 
viously. Since there are no transitions 
between singlets and triplets it is con- 
venient to draw them as two separate 
systems. Notice that in the radiative 
decay of helium atoms excited to high- 
lying levels there are bottlenecks in the 
metastable 1s2s 1S and 1s2s °S terms. 
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18 Here Q (co) = —e. 


19 As is usual in calculations of the in- 
teraction between electric charge dis- 
tributions, one must be careful to 
avoid double counting. This method 
of calculation avoids this pitfall, as 
shown by the general argument in Ap- 
pendix B. An alternative method is 
used in Woodgate (1980), Problem 5.5. 


20The effect of the repulsion propor- 
tional to 1/rig can be considered in 
terms of potentials like that in eqn 3.21 
{and Appendix B). The potential at 
the position of the outer electron ro 
arising from the charge distribution of 
electron 1 accounts for a large por- 
tion of the total repulsion: Vi2(r2) ~ 
e?/4me€or2 in the region where pn; (r2) 
has an appreciable value. Hence it 
makes sense to include e7/4m ore in 
the zeroth-order Hamiltonian Ho, and 
treat the (small) part left over as a per- 
turbation Hj. 


The spherical symmetry of s-electrons means that the charge in the 
region r; < rg acts like a point charge at the origin, so that 


Q (r2) 


Viz (ra) a AT eor2 i 


where Q (rz) is the charge within a radius of rz from the origin, which 
is given by?® 


T2 
Q (re) = | p(ry) 4ar? dr). (3.22) 
0 
The electrostatic energy that arises from the repulsion equals 
co 
Fi2 = i Vie (r2) p (r2) Anr dro : (3.23) 
0 


For the 1s? configuration there is an exactly equal contribution to the 
energy from V2, (11), the (partial) potential at r; produced by electron 2. 
Thus the total energy of the repulsion between the electrons is twice that 
in eqn 3.23.!9 Using the radial wavefunction for a 1s-electron, we find 


iam [iad eee P 
ier =x ras / SAG eae 80) 2m an} 4Z3@~(2/00)2r 292 dpy 
TE Jo 0 11 


_ e7/Ar€9 5 


3.24 
2ao 4 ( ) 


Z = (13.6eV) x a. 
For helium this gives JZ7” = 34 eV. 


3.3.2 Excited states: the direct integral 


A isnl configuration of helium has an energy close to that of an nl- 
electron in hydrogen, e.g. in the 1s2p configuration the 2p-electron has 
a similar binding energy to the n = 2 shell of hydrogen. The obvious 
explanation, in Bohr’s model, is that the 2p-electron lies outside the 1s- 
orbit so that the inner electron screens the outer one from the full nuclear 
charge. Applying an analogous argument to the quantum treatment of 
helium leads to the Hamiltonian H = Ho, + H!, where® 


lie e? 2 1 
Hoa = ee 2) - —+— : 
(a = ee ( ve ) (3.25) 


pee (—-=). 
ATreQ (END (PS) 


In the expression for Ho,, electron 2 experiences the Coulomb attraction 
of a charge +le. In H/ the subtraction of e?/4regr2 from the mutual 
repulsion means that the perturbation tends to zero at a large distance 
from the nucleus (which is intuitively reasonable). This decomposition 
differs from that in Section 3.1. The different treatment of the two 
electrons makes the perturbation theory a little tricky, but Heisenberg 


and 


(3.26) 
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did the calculation as described in Bethe and Salpeter (1957) or Bethe 
and Salpeter oe he found the direct integral 


Jian = om | / (= -=) fag(1)? |ttmim(2)2 d2x; d2ry. (3.27) 


This must be evaluated with the appropriate wavefunctions, i.e. uZ7! 
rather than uZ—?, and uZ~? as before.”! For the excited electron Untm = 
Rrit(r) Yim (9, ¢), where R,:(r) is the radial function for Z = 1. We write 
the direct integral as 


co co 
ye = [ / J(r1,72) Rig(ri)Rey(r2)ri dri rg dre, (3.28) 
0 0 


4nr €9 


where the angular parts are contained in the function”? 


I(r1,12) =| 7 : / 7 (= - ~) i ila 00) (3.29) 


x sin 6; dé; dd; sin 62 dO2 d¢o . 


The calculation of this integral requires the expansion of 1/rj2 in terms 
of spherical harmonics:2? 


los) k 
a) (2) = 4 2 ¥%q (01,41) Yang (82,42) (3.30) 
0 


ik r tf 
12 2 = 2 


for ro > r; (and ry © rg when r; > rg). Only the term for k = 0 
survives in the integration over angles in eqn 3.29 to give?* 


for 71 < Te, 
l/r; —lfre forr, >re. 


When r; < rz the original screening argument applies and eqn 3.25 
gives a good description. When r; > rg the appropriate potential is 
proportional to —2/r2g —1/r; and J(r1,r2) accounts for the difference 
between this and —2/r; — 1/r2 used in Hog. Thus we find 


op ad ia ee R2(r1)r? dry > R2,(r2)r5 dr 
l= Ae i = m1 12 ONY Us eal PEI Y Se 


J(r1,72) = 


(3.31) 
Evaluation of this integral for the 1s2p configuration (in Exercise 3.6) 
gives Jisop = —2.8 x 10~*eV—three orders of magnitude smaller than 


JZ>? in eqn 3.7 (evaluated from eqn 3.24). The unperturbed wavefunc- 
tion for Z = 1 has energy equal to that of the corresponding level in 
hydrogen and the small negative direct integral accounts for the incom- 
pleteness of the screening of the nl-electron by the inner electron. 


3.3.3. Excited states: the exchange integral 


The exchange integral ea the same form as eqn 3.16 but with uZ7! 


rather than uZ? (and uf,~? as before). Within the spatial wavefunction 


?1 We have not derived this integral rig- 
orously but it has an intuitively reason- 
able form. 


2299 (61,61) = 1/V4m. 


*8y/ | (81501) = (—1)* Yi,—q (81,41). 


24 When k # 0 the integral of the func- 
tion re (91,61) over 6 and ¢; equals 
zero. 
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25.4¢ small r the wavefunction of an 
ns-electron deviates significantly from 
uZ="; for this reason 1s2p was chosen 
as an eae above. 


6 The overlap of the Is and nl wave- 
functions becomes smaller as n and 
i increase. In Heisenberg’s treatment 
where screening is taken into account, 
the direct integral gives the deviation 
from the hydrogenic levels (which could 
be characterised by a quantum de- 
fect as in the alkalis, see Chapter 4). 
For electrons with | # O the term 
—h?1(1+ 1) /2mr? in the Schrédinger 
equation causes the electron’s wave- 
function to lie almost entirely outside 
the region where uZ>? = Ris (r) //4a 
has a significant ane. 


Unim = Rni(r)Yim(@,¢) only the radial part depends on Z. We write 
the exchange integral as (cf. eqn 3.28) 


Kisnt— a ffx K (ry, 72) Ris(t1)Rni(11) Ris(r2) Rat (r2)r} dri re dro. 
(3.32) 


The function K(r1,r2) containing the angular integrals is (cf. eqn 3.29) 


K(ri,r2) = If — Yi (01, 81) g-Vim (02, 2) 


x sin A; dé, dd; sin 65 dé, d¢d2 5 


rae 


(3.33) 


For the 1lsnp configuration only the second term of the expansion in 
eqn 3.30, with k = 1, survives in the integration because of the orthog- 
onality of the spherical harmonic functions (see Exercise 3.7), to give 


r/3re 
r2/3r? 


for Tr; < 12, 


K(ri,7r2) = (3.34) 


for rg <1). 


Carrying out the integration over the radial wavefunctions in eqn 3.32 for 
the 1s2p configuration gives the splitting between °P and 'P as 2Kisop ~ 
0.21eV (close to the measured value of 0.25 eV). 

The assumption that the excited electron lies outside the ls wave- 
function does not work so well for 1sns configurations since ws (0) has 
a finite value and the above method of calculating J and K is less 
accurate.2° The 1s2s configuration of helium has a singlet-triplet sepa- 
ration of E (1S) — E (3S) = 2K 1525 ~ 0.80eV and the direct integral is 
also larger than that for 1s2p—these trends are evident in Fig. 3.4 (see 
also Exercise 3.7).7° 

In some respects, helium is a more typical atom than hydrogen. The 
Schrodinger and Dirac equations can be solved exactly for the one- 
electron system, but not for helium or other atoms with more electrons. 
Thus in a careful study of helium we encounter the approximations 
needed to treat multi-electron atoms, and this is very important for 
understanding atomic structure in general. Helium also gives a good 
example of the influence of identical particles on the occupation of the 
states in quantum systems. The energy levels of the helium atom (and 
the existence of exchange integrals) do not depend on the fact that the 
two electrons are identical, as demonstrated in Exercises 3.3 and 3.4; 
however, this is a common point of confusion. The books recommended 
for further reading give clear and accurate descriptions of helium that 
reward careful study. 


Further reading 


The recommended books are divided into two categories corresponding 
to the two main themes in this chapter: (a) a description of how to 
calculate the electrostatic energy in an atom with more than one elec- 
tron, which introduces principles that can be used in atoms with more 
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electrons; and (b) a discussion of the influence of identical particles on 
the statistics of a quantum system that is important throughout physics. 
The influence of identical particles on the occupation of the quantum lev- 
els of a system with many particles, i.e. Bose-Einstein and Fermi—Dirac 
statistics, is discussed in statistical mechanics texts. Clear descriptions 
of helium may be found in the following textbooks: Cohen-Tannoudji et 
al. (1977), Woodgate (1980) and Mandl (1992). The calculation of the 
direct and exchange integrals in Section 3.3 is based on the definitive 
work by Bethe and Salpeter (1957), or see Bethe and Jackiw (1986). 

A very instructive comparison can be made between the properties of 
the two electrons in helium and the nuclear spin statistics of homonu- 
clear diatomic molecules?’ described in Atkins (1983, 1994).28 There ?7Molecules made up of two atoms with 
are diatomic molecules with nuclei that are identical bosons, identical identical nuclei. 
fermions and cases of two similar but not identical particles, and their 7®These books also summarise the he- 
study gives a wider perspective than the study of helium alone. The oe 
nuclei of the two atoms in a hydrogen molecule are protons which are  ciosely related to ee " 
fermions (like the two electrons in helium).?° For reasons explained in ree ee 
the above references, we can consider only those parts of the molecular joiecule has exchange va 
wavefunction that describe the rotation wo, and the nuclear spin states crudely speaking, the molecule looks 
wy —these are spatial and spin wavefunctions, respectively. For Hz the the same when rotated through 180°. 
wavefunction must have overall antisymmetry with respect to an inter- 
change of particle labels since the nuclei are protons, each with a spin 
of 1/2. This requires that a rotational must pair with a spin function of 
the opposite symmetry: 


Wmolecule = weve or pa wr : (3.35) 


This is analogous to eqn 3.18 for helium; as described in Section 3.2.1, 
the two spin-1/2 nuclei in a hydrogen molecule give a total (nuclear) 
spin of 0 and 1, with one state and three states, respectively. The 1 to 3 
ratio of the number of nuclear spins associated with the energy levels for 
ws, and ~4,, respectively, influences the populations of these rotational 
energy levels in a way that is directly observed in molecular spectra (the 
intensity of the lines in spectra depends on the population of the initial 
level). The molecule HD made from hydrogen and deuterium does not 
have identical nuclei so there is no overall symmetry requirement, but 
it has similar energy levels to those of Hz apart from the mass depen- 
dence. This gives a real physical example where the statistics depends 
on whether the particles are identical or not, but the energy of the sys- 
tem does not. Exercise 3.4 discusses an artificial example: a helium-like 
system that has the same energy levels as a helium atom and hence the 
same direct and exchange integrals, even though the constituent parti- 
cles are not identical. 
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Exercises 


More advanced problems are indicated by a *. 


(3.1) 


(3.2) 


(3.3) 


Estimate of the binding energy of helium 


(a) Write down the Schrédinger equation for the 
helium atom and state the physical significance 
of each of the terms. 


Estimate the equilibrium energy of an electron 
bound to a charge +Ze by minimising 


(b) 


Bee 


4neor 


h2 


2mr2 


E(r)= 


(c) Calculate the repulsive energy between the two 
electrons in helium assuming that rig ~ r. 


Hence estimate the ionization energy of helium. 


Estimate the energy required to remove a fur- 
ther electron from the helium-like ion Si)?*, 
taking into account the scaling with Z of the 
energy levels and the expectation value for 
the electrostatic repulsion. The experimen- 
tal value is 2400eV. Compare the accuracy of 
your estimates for Si!?* and helium. (IE(He) 
= 24.6eV.) 


Direct and exchange integrals for an arbitrary 
system 
(a) Verify that for 


w* (r1,72) 
= {ua (r1)ug (r2) — Ua (r2)ug (ri) } 


and H’ = e*/4meori2 the expectation value 
(p*| lah |p”) has the form J — K and give the 
expressions for J and K. 

(b) Write down the wavefunction w* that is orthog- 
onal to wi. 

(c) Verify that (“| H'|y*) = 0 so that H’ is di- 
agonal in this basis. 


Exchange integrals for a delta-function interaction 

A particle in a square-well potential, with V(x) = 0 

for 0 < x < & and V(x) = oo elsewhere, has 

normalised eigenfunctions uo(x) = \/2/£sin (2/ £) 

and u(x) = /2/ésin (202/ £). 

(a) What are the eigenenergies Eo and FE, of these 
two wavefunctions for a particle of mass m? 


(3.4) 


-—_ 


(b) Two particles of the same mass m are both in 
the ground state so that the energy of the whole 
system is 2Eo. Calculate the perturbation pro- 
duced by a point-like interaction described by 
the potential a6 (z1 — #2), with a constant. 


Show that, when the two interacting particles 
occupy the ground and first excited states, the 
direct and exchange integrals are equal. Also 
show that the delta-function interaction pro- 
duces no energy shift for the antisymmetric spa- 
tial wavefunction and explain this in terms of 
correlation of the particles. Calculate the en- 
ergy of the other level of the perturbed system. 


(c) 


(d) For the two energy levels found in part (c), 
sketch the spatial wavefunction as a function 
of the coordinates of the two particles x; and 
x2. The particles move in one dimension but 
the two-particle wavefunction exists in a two- 
dimensional Hilbert space—draw either a con- 
tour plot in the 7; x2-plane or attempt a three- 


dimensional sketch (by hand or computer). 


(e) The two particles are identical and have spin 
1/2. What is the total spin quantum number S$ 
associated with each of the energy levels found 


in part (c)? 


Discuss qualitatively the energy levels of this 
system for two particles that have slightly dif- 
ferent masses mi # Mz, so that they are distin- 
guishable? [Hint. The spin has not been given 
because it is not important for non-identical 
particles. | 


Comment. The antisymmetric spatial wavefunction 
in part (c) clearly has different properties from a 
straightforward product uoui. The exchange inte- 
gral is a manifestation of the entanglement of the 
multiple-particle system. 


A helium-like system with non-identical particles 

Imagine that there exists an exotic particle with the 
same mass and charge as the electron but spin 3/2 
(so it is not identical to the electron). This par- 
ticle and an electron form a bound system with a 
helium nucleus. Compare the energy levels of this 
system with those of the helium atom. Describe 
the energy levels of a system with two of the ex- 


(3.5) 


(3.6) 


(3.7) 


otic particles bound to a helium nucleus (and no 
electrons). [Hint. It is not necessary to specify the 
values of total spin associated with the levels.] 


The integrals in helium 

(a) Show that the integral in eqn 3.24 gives the 
value stated in eqn 3.7. 

(b) Estimate the ground-state energy of helium us- 
ing the variational principle. (The details of 
this technique are not given in this book; see 
the section on further reading.) 

Calculation of integrals for the 1s2p configuration 

(a) Draw a scale diagram of RZ-?(r), R¥%-! (r) 
and RZ; (r). (See Table 2.2.) 

(b) Calculate the direct integral in eqn 3.31 and 
show that it gives 


_e'/4neg 13 
2ag 2x 55° 


Chie = 


Give the numerical value in eV (cf. that given 
in the text). 
Expansion of 1/rie 
For r; < r2 the binomial expansion of 
1 


—= = (ri + re — 2rir2cos 612) 
Law: 


1/2 
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igo aa 
— 1 {1-28 conte + (2) 
712 T2 r2 r2 


© {14 Doostia+...}, 
T2 


I? 


; (3.36) 


(When r; > r2 we must interchange r; and r2 to ob- 
tain convergence.) The cosine of the angle between 
r; and ro is 


cos 612 =T1 ‘Te 


= cos 4; cos 02 + sin 4; sin 62 cos (fi — ¢2). 


(a) Show that the first two terms in the binomial 
expansion agree with the terms with k = 0 and 
1 in eqn 3.30. 


(b) The repulsion between a 1s- and an nl-electron 
is independent of m. Explain why, physically 
or mathematically. 


(c) Show that eqn 3.32 leads to eqn 3.34 for 1 = 1. 


(d) For a 1snl configuration, the quantity K(r1,r2) 
in eqn 3.34 is proportional to rj /rj+! when 
T1 < r2. Explain this in terms of mathemat- 
ical properties of the Y;,, functions. 
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1 Also referred to by its original German 
name as the Aufbauprinciple. An ex- 
tensive discussion of the atomic struc- 
ture that underlies the periodic table 
can be found in chemistry texts such as 
Atkins (1994), 


2 Most of the arrangement of elements 
in a periodic table was determined by 
chemists, such as Mendeleev, in the 
nineteenth century. A few inconsisten- 
cies in the ordering were resolved by 
Moseley’s measurements of X-ray spec- 
tra (see Chapter 1). 


3The configuration of an atom is spec- 
ified by a list of ni with the occupancy 
as an exponent. Generally, we do not 
need to list the full configuration and it 
is sufficient to say that a sodium atom 
in its ground state has the configura- 
tion 3s. A sodium ‘atom’ with one elec- 
tron in the 3s level, and no others, is 
an excited state of the highly-charged 
ion Nat!°_+this esoteric system can be 
produced in the laboratory but confu- 
sion with the common sodium atom is 
unlikely. 


The alkalis 


4.1 Shell structure and the periodic table 


For multi-electron atoms we cannot solve the Hamiltonian analytically, 
but by making appropriate approximations we can explain their struc- 
ture in a physically meaningful way. To do this, we start by considering 
the elementary ideas of atomic structure underlying the periodic table 
of the elements. In the ground states of atoms the electrons have the 
configuration that minimises the energy of the whole system. The elec- 
trons do not all fall down into the lowest orbital with n = 1 (the K-shell) 
because the Pauli exclusion principle restricts the number of electrons 
in a given (sub-)shell—two electrons cannot have the same set of quan- 
tum numbers. This leads to the ‘building-up’ principle: electrons fill up 
higher and higher shells as the atomic number Z increases across the 
periodic table.! Full shells are found at atomic numbers Z = 2,10,... 
corresponding to helium and the other inert gases. These inert gases, in 
a column on the right-hand side of the periodic table (see inside front 
cover), were originally grouped together because of their similar chemical 
properties, i.e. the difficulty in removing an electron from closed shells 
means that they do not readily undergo chemical reactions.” However, 
inert gas atoms can be excited to higher-lying configurations by bom- 
bardment with electrons in a gas discharge, and such processes are very 
important in atomic and laser physics, as in the helium—neon laser. 

The ground states of the alkalis have the following electronic con- 
figurations: 


lithium Li 
Na 
potassium kK 
rubidium Rb 
caesium Cs 


1e%25%, 

1s*25720° 35, 

1s?2s?2p°3s73p° 4s, 
1s?2s?2p°3s23p°3d104s74p 53, 
1s?72s?2p*3s73p°3d194s74p%4d!95s25p® 6s. 


sodium 


The alert reader will notice that the sub-shells of the heavier alkalis 
are not filled in the same order as the hydrogenic energy levels, e.g. elec- 
trons occupy the 4s level in potassium before the 3d level (for reasons 
that emerge later in this chapter). Thus, strictly speaking, we should 
say that the inert gases have full sub-shells, e.g. argon has the electronic 


Table 4.1 Ionization energies of the inert gases and alkalis. 


Element Z _  IE(eV) 
He 2 24.6 
Li 3 5.4 
Ne 10 21.6 
Na iit @el 
Ar 18 15.8 
K 19 4.3 
Kr 36 14.0 
Rb 37 «4.2 
Xe 54 iyi 


configuration 1s?2s?2p°3s?3p° with the 3d sub-shell unoccupied.‘ 

Each alkali metal comes next to an inert gas in the periodic table 
and much of the chemistry of the alkalis can be explained by the simple 
picture of their atoms as having a single unpaired electron outside a core 
of closed electronic sub-shells surrounding the nucleus. The unpaired 
valence electron determines the chemical bonding properties; since it 
takes less energy to remove this outer electron than to pull an electron 
out of a closed sub-shell (see Table 4.1), thus the alkalis can form singly- 
charged positive ions and are chemically reactive.° However, we need 
more than this simple picture to explain the details of the spectra of the 
alkalis and in the following we shall consider the wavefunctions. 


4.2 The quantum defect 


The energy of an electron in the potential proportional to 1/r depends 
only on its principal quantum number n, e.g. in hydrogen the 3s, 3p and 
3d configurations all have the same gross energy. These three levels are 
not degenerate in sodium, or any atom with more than one electron, 
and this section explains why. Figure 4.1 shows the probability density 
of 3s-, 3p- and 3d-electrons in sodium. The wavefunctions in sodium 
have a similar shape (number of nodes) to those in hydrogen. The 3d 
wavefunction has a single lobe outside the core so that it experiences 
almost the same potential as in a hydrogen atom; therefore this electron, 
and other d configurations in sodium with n > 3, have binding energies 
similar to those in hydrogen, as shown in Fig. 4.2. In contrast, the 
wavefunctions for the s-electrons have a significant value at small r— 
they penetrate inside the core and ‘see’ more of the nuclear charge. Thus 
the screening of the nuclear charge by the other electrons in the atom is 
less effective for ns configurations than for nd, and s-electrons have lower 
energy than d-electrons with the same principal quantum number. (The 
np-electrons lie between these two.®) The following modified form of 
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“This book takes a shell to be all energy 
levels of the same principal quantum 
number n, but the meaning of shell and 
sub-shell may be different elsewhere. 
We use sub-shell to denote all energy 
levels with specific values of n and | 
(in a shell with a given value of n). 
We used these definitions in Chapter 1; 
the inner atomic electrons involved in 
X-ray transitions follow the hydrogenic 
ordering. 


For a plot of the ionization energies of 
all the elements see Grant and Phillips 
(2001, Chapter 11, Fig. 18). This 
figure is accessible at http://www. 
oup.co.uk/best.textbooks/physics/ 
ephys/illustrations/. 


6 This dependence of the energy on the 
quantum number / can also be ex- 
plained in terms of the elliptical or- 
bits of Bohr-Sommerfeld quantum the- 
ory rather than Schrodinger’s wave- 
functions; however, we shall use only 
the ‘proper’ wavefunction description 
since the detailed correspondence be- 
tween the elliptical classical trajectories 
and the radial wavefunctions can lead 
to confusion. 
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Fig. 4.1 The probability density of the 
electrons in a sodium atom as a func- 
tion of r. The electrons in the n = 1 
and n = 2 shells make up the core, 
and the probability density of the un- 
paired outer electron is shown for the 
nm = 3 shell with 1 = 1, 2 and 3. The 
probability is proportional to |P(r)|? = 
r?|R(r)|?; the r? factor accounts for the 
increase in volume of the spherical shell 
between r and r+dr (i.e. 4ar? dr) as 
the radial distance increases. The de- 
creasing penetration of the core as | 
increases can be seen clearly—the 3d- 
electron lies mostly outside the core 
with a wavefunction and binding en- 
ergy very similar to those for the 3d 
configuration in hydrogen. These wave- 
functions could be calculated by the 
simple numerical method described in 
Exercise 4.10, making the ‘frozen core’ 
approximation, i.e. that the distribu- 
tion of the electrons in the core is 
not affected by the outer electron—this 
gives sufficient accuracy to illustrate 
the qualitative features. (The iterative 
method described in Section 4.4 could 
be used to obtain more accurate numer- 
ical wavefunctions.) 


’This differs from the modification 
used for X-ray transitions in Chapter 
1—hardly surprising since the physical 
situation is completely different for the 
inner and outer electrons. 


(a) 


c 0! a Te-25- 20° 


(b) 


Core\ 1s”2s°2p® 


Bohr’s formula works amazingly well for the energy levels of the alkalis: 


Roo 


E (n,l) = ae ees : 


(4.1) 


A quantity 4), called the quantum defect, is subtracted from the prin- 
cipal quantum number to give an effective principal quantum number 
n* =n — 6." The values of the quantum defects for each | can be esti- 
mated by inspecting the energy levels shown in Fig. 4.2. The d-electrons 
have a very small quantum defect, dq ~ 0, since their energies are nearly 
hydrogenic. We can see that the 3p configuration in sodium has com- 
parable energy to the n = 2 shell in hydrogen, and similarly for 4p and 
n = 3, etc.; thus dp ~ 1. It is also clear that the quantum defect for 
s-electrons is greater than that for p-electrons. A more detailed analysis 
shows that all the energy levels of sodium can be parametrised by the 
above formula and only three quantum defects: 


Sodium es! 
ay ns np nd nf oon 


— a 4 
=] =. 
, & — 3 
24 _ 
4 = 
2? 


age! 


bs = 1.35, 6)=0.86, da=0.01, 6 ~0.00 for 1>2. 


There is a small variation with n (see Exercise 4.3). Having examined 
the variation in the quantum defects with orbital angular momentum 
quantum number for a given element, now let us compare the quantum 
defects in different alkalis. The data in Table 4.1 show that the alkalis 
have similar ionization energies despite the variation in atomic number. 
Thus the effective principal quantum numbers n* = (13.6 eV /IE)!/ 
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Fig. 4.2 The energies of the s, p, d and 
f configurations in sodium. The energy 
levels of hydrogen are marked on the 
right for comparison. The guidelines 
link configurations with the same n to 
show how the energies become closer to 
the hydrogenic values as / increases, i.e. 
the quantum defects decrease so that 
6, ~ 0 for f-electrons (and for the con- 
figurations with 1 > 3 that have not 
been drawn). 
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8 For example, lithium has three inter- 
actions between the three electrons, in- 
versely proportional to rj2, 713 and 723; 
summing over all 7 for each value of 2 
would give six terms. 


Table 4.2 The effective principal quantum numbers and quantum defects for the 
ground configuration of the alkalis. Note that the quantum defects do depend slightly 
on n (see Exercise 4.3), so the value given in this table for the 3s-electron in sodium 
differs slightly from the value given in the text (ds = 1.35) that applies for n > 5. 


Element Configuration n* ds 
Li 2s 1.59 0.41 
Na 3s 163 1.37 
K 4s Wetted} 
Rb 5S 1.81 3.19 


Cs 6s 1S i reals 


(from eqn 4.1) are remarkably similar for all the ground configurations 
of the alkalis, as shown in Table 4.2. 

In potassium the lowering of the energy for the s-electrons leads to the 
4s sub-shell filling before 3d. By caesium (spelt cesium in the US) the 
6s configuration has lower energy than 4f (d¢ ~ 0 for Cs). The exercises 
give other examples, and quantum defects are tabulated in Kuhn (1969) 
and Woodgate (1980), amongst others. 


4.3. The central-field approximation 


The previous section showed that the modification of Bohr’s formula by 
the quantum defects gives reasonably accurate values for the energies 
of the levels in alkalis. We described an alkali metal atom as a single 
electron orbiting around a core with a net charge of +1e, i.e. the nucleus 
surrounded by N — 1 electrons. This is a top-down approach where we 
consider just the energy required to remove the valence electron from 
the rest of the atom; this binding energy is equivalent to the ionization 
energy of the atom. In this section we start from the bottom up and 
consider the energy of all the electrons. The Hamiltonian for N electrons 
in the Coulomb potential of a charge + Ze is 


N N 
h? Ze?/Areo e7/Areg 
Hp Jy? . Se ee 4.2 
2d 2m Ti » lig Ce) 


The first two terms are the kinetic energy and potential energy for each 
electron in the Coulomb field of a nucleus of charge Z. The term with 
rij = |r; —1;| in the denominator is the electrostatic repulsion between 
the two electrons at r; and r;. The sum is taken over all electrons with 
j > i to avoid double counting.® This electrostatic repulsion is too large 
to be treated as a perturbation; indeed, at large distances the repulsion 
cancels out most of the attraction to the nucleus. To proceed further 
we make the physically reasonable assumption that a large part of the 
repulsion between the electrons can be treated as a central potential 
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S(r). This follows because the closed sub-shells within the core have 
a spherical charge distribution, and therefore the interactions between 
the different shells and between shells and the valence electron are also 
spherically symmetric. In this central-field approximation the total po- 
tential energy depends only on the radial coordinate: 


Vor (r) = ee + S(r). (4.3) 
In this approximation the Hamiltonian becomes 
N Re, 
Her = {a} + Vor he (4.4) 


i=1 
For this form of potential, the Schrédinger equation for N electrons, 
Hw = Eatomy, can be separated into N one-electron equations, i.e. 
writing the total wavefunction as a product of single-electron wavefunc- 
tions, namely 


Yatom = Yitey3-:- Yn, (4.5) 


leads to N equations of the form 


2 
{373 + Vor ry} 1 = Fit, (4.6) 


and similar for electrons i = 2 to N. This assumes that all the elec- 
trons see the same potential, which is not as obvious as it may appear. 
This symmetric wavefunction is useful to start with (cf. the treatment 
of helium before including the effects of exchange symmetry); however, 
we know that the overall wavefunction for electrons, including spin, 
should be antisymmetric with respect to an interchange of the particle 
labels. (Proper antisymmetric wavefunctions are used in the Hartree— 
Fock method mentioned later in this chapter.) The total energy of the 
system is atom = &, + Eo +...+ En. The Schrédinger equations for 
each electron (eqn 4.6) can be separated into parts to give wavefunctions 
of the form %1 = R(r1)¥i,,m,¥spin(1). Angular momentum is conserved 
in a central field and the angular equation gives the standard orbital 
angular momentum wavefunctions, as in hydrogen. In the radial equa- 
tion, however, we have Vor(r) rather than a potential proportional to 
1/r and so the equation for P(r) = rR (r) is 
2 Ad 2 

\-3043 + Vor (0) + 0S po) = EPO). 4 
To solve this equation we need to know the form of Vor(r) and compute 
the wavefunctions numerically. However, we can learn a lot about the 
behaviour of the system by thinking about the form of the solutions, 
without actually getting bogged down in the technicalities of solving the 
equations. At small distances the electrons experience the full nuclear 
charge so that the central electric field is 


Ze ~ 
——-fr 
Ateor? 


E(r) > (4.8) 
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°This is not necessarily the best way to 
parametrise the problem for numerical 
calculations but it is useful for under- 
standing the underlying physical prin- 
ciples. 


Fig. 4.4 The form of the potential en- 
ergy of an electron in the central-field 
approximation (e2, = e?/4meo). This 
approximate sketch for a sodium atom 
shows that the potential energy crosses 
over from Vor(r) = —e2,/r at long 
range to —1le?,/r + Voftset; the con- 
stant Vose, comes from the integration 
in eqn 4.11 (if Zeg¢(r) = 11 for all r then 
Voffset = 0 but this is not the case). For 
electrons with | > 0 the effective poten- 
tial should also include the term that 
arises from the angular momentum, as 
shown in Fig. 4.5. 


r 


Fig. 4.3 The change-over from the short- to the long-range is not calculated but is 
drawn to be a reasonable guess, using the following criteria. The typical radius of 
the 1s wavefunction around the nucleus of charge +Ze = +1le is about ao/11, and 
so Zeg will start to drop at this distance. We know that Z.q_ ~ 1 at the distance 
at which the 3d wavefunction has appreciable probability since that eigenstate has 
nearly the same energy as in hydrogen. The form of the function Ze (r) can be 
found quantitatively by the Thomas—Fermi method described in Woodgate (1980). 


At large distances the other N — 1 electrons screen most of the nuclear 
charge so that the field is equivalent to that of charge +le: 


Es 


BC (4.9) 


——r. 
Arregr? 
These two limits can be incorporated in a central field of the form 


Zoffe ae 


Ecr(r) > (4.10) 


Anegr? — 
The effective atomic number Zeg (r) has limiting values of Ze(0) = Z 
and Zeg (r) -> 1 as r — oo, as sketched in Fig. 4.3.2 The potential 
energy of an electron in the central field is obtained by integrating from 
infinity: 


Vor(r) =e i [Ecr(r’)| dr’ . (4.11) 


The form of this potential is shown in Fig. 4.4. 
So far, in our discussion of the sodium atom in terms of the wave- 
function of the valence electron in a central field we have neglected 


—tte?, + constant 
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A> U(L+1) 


2mr 
7 A7UI+1) 
Vor (r) 2 2mr? 


the fact that the central field itself depends on the configuration of the 
electrons in the atom. For a more accurate description we must take 
into account the effect of the outer electron on the other electrons, and 
hence on the central field. The energy of the whole atom is the sum of 
the energies of the individual electrons (in eqn 4.6), e.g. asodium atom in 
the 3s configuration has energy E (1s?2s?2p® 3s) = 2E\,+2E25+6E2op + 
F3, = Ecore + £35. This is the energy of the neutral atom relative to the 
bare nucleus (Na!*).!° It is more useful to measure the binding energy 
relative to the singly-charged ion (Nat) with energy E (1s*2s*2p*) = 
2E,+2E5,+6E5, = Evore- The dashes are significant—the ten electrons 
in the ion and the ten electrons in the core of the atom have slightly 
different binding energies because the central field is not the same in 
the two cases. The ionization energy is IE = Eatom — Ejon = (Ecore — 
El ore) + E3s. From the viewpoint of valence electrons, the difference 
in Exore between the neutral atom and the ion can be attributed to 
core polarization, i.e. a change in the distribution of charge in the core 
produced by the valence electron.!! To calculate the energy of multi- 


Fig. 4.5 The total potential in the 


central-field approximation including 
the term that is proportional to I{l + 
1)/r? drawn here for | = 2 and the 
same approximate electrostatic Vor(r) 
as shown in Fig. 4.4. The angular 
momentum leads to a ‘centrifugal bar- 
rier’ that tends to keep the wavefunc- 
tions of electrons with | > 0 away from 
r = 0 where the central-field potential 


is deepest. 


10-This is a crude approximation, espe- 
cially for inner electrons. 


‘1 Phis effect is small in the alkalis and 
it is reasonable to use the frozen core 
approximation that assumes Ecore ~ 
Etore- This approximation becomes 
more accurate for a valence electron in 
higher levels where the influence on the 
core becomes smaller. 
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12This is in contrast to an infinite 
square well where confinement to a re- 
gion of fixed dimensions gives energies 
proportional to n”, where n is an inte- 
ger. 


electron atoms properly we should consider the energy of the whole 
system rather than focusing attention oh only the valence electron. For 
example, neon has the ground configuration 1s?2s*2p® and the electric 
field changes significantly when an electron is excited out of the 2p sub- 
shell, e.g. into the 1s?2s?2p°3s configuration. 

Quantum defects can be considered simply as empirical quantities that 
happen to give a good way of parametrising the energies of the alkalis but 
there is a physical reason for the form of eqn 4.1. In any potential that 
tends to 1/r at long range the levels of bound states bunch together 
as the energy increases—at the top of the well the classically allowed 
region gets larger and so the intervals between the eigenenergies and the 
stationary solutions get smaller.!? More quantitatively, in Exercise 1.12 
it was shown, using the correspondence principle, that such a potential 
has energies E «x 1/k?, with Ak = 1 between energy levels, but k 
is not itself necessarily an integer. For the special case of a potential 
proportional to 1/r for all distances, k is an integer that we call the 
principal quantum number n and the lowest energy level turns out to 
be n = 1. For a general potential in the central-field approximation we 
have seen that it is convenient to write k in terms of the integer n as 
k =n-—0, where 6 is a non-integer (quantum defect). To find the actual 
energy levels of an alkali and hence 6 (for a given value of /) requires the 
numerical calculation of the wavefunctions, as outlined in the following 
section. 


4.4 Numerical solution of the Schrodinger 
equation 


Before describing particular methods of solution, let us look at the gen- 
eral features of the wavefunction for particles in potential wells. The 
radial equation for P(r) has the form 


ele 


a = -F{e-Vin}P, (4.12) 


where the potential V (r) includes the angular momentum term in eqn 
4.7. Classically, the particle is confined to the region where E—V (r) > 0 
since the kinetic energy must be positive. The positions where E = V (r) 
are the classical turning points where the particle instantaneously comes 
to rest, cf. at the ends of the swing of a pendulum. The quantum 
wavefunctions are oscillatory in the classically allowed region, with the 
curvature and number of nodes both increasing as E — V (r) increases, 
as shown in Fig. 4.6. The wavefunctions penetrate some way into the 
classically forbidden region where E — V (r) < 0; but in this region the 
solutions decay exponentially and the probability falls off rapidly. 

How can we find P (r) in eqn 4.12 without knowing the potential V(r)? 
The answer is firstly to find the wavefunctions for a potential Vor(r) 
that is ‘a reasonable guess’, consistent with eqn 4.11 and the limits on 
the central electric field in the previous equations. Then, secondly, we 
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Arbitrary 
units 


make the assumed potential correspond closely to the real potential, as 
described in the next section. Equation 4.12 is a second-order differen- 
tial equation and we can numerically calculate P(r), the value of the 
function at r, from two nearby values, e.g. u(r — 6r) and u(r — 26r).'8 
Thus, working from near r = 0, the method gives the numerical value of 
the function at all points going out as far as is necessary. The region of 
the calculation needs to extend beyond the classical turning point(s) by 
an amount that depends on the energy of the wavefunction being calcu- 
lated. These general features are clearly seen in the plots produced in 
Exercise 4.10. Actually, that exercise describes a method of finding the 
radial wavefunction R(r) rather than P(r) = rR(r) but similar princi- 
ples apply.'* If you carry out the exercise you will find that the behaviour 
at large r depends very sensitively on the energy E—the wavefunction 
diverges if FE’ is not an eigenenergy of the potential—this gives a way 
of searching for those eigenenergies. If the wavefunction diverges up- 
wards for E’ and downwards for E” then we know that an eigenenergy 
of the system E; lies between these two values, E’ < Ex, < E". Test- 
ing further values between these upper and lower bounds narrows the 
range and gives a more precise value of E, (as in the Newton—Raphson 
method for finding roots). This so-called ‘shooting’ method is the least 
sophisticated method of computing wavefunctions and energies, but it 
is adequate for illustrating the principles of such calculations. Results 
are not given here since they can readily be calculated—the reader is 
strongly encouraged to implement the numerical method of solution, us- 
ing a spreadsheet program, as described in Exercise 4.10. This shows 
how to find the wavefunctions for an electron in an arbitrary potential 
and verifies that the energy levels obey a quantum defect formula such 
as eqn 4.1 in any potential that is proportional to 1/r at long range (see 
Fig. 4.7). 


Fig. 4.6 The potential in the central- 
field approximation including the term 
that is proportional to I(1 + 1)/r? is 
drawn here for 1 = 2 and the same 
approximate electrostatic Vor(r) as 
shown in Fig. 4.4. The function P(r) = 
rR(r) was drawn for n = 6 and l = 2 
using the method described in Exer- 
cise 4.10. 


The step size ér must be small com- 
pared to the distance over which the 
wavefunction varies; but the number of 
steps must not be so large that round- 
off errors begin to dominate. 


147 a numerical method there is no 
reason why we should not calculate the 
wavefunction directly; P(r) was intro- 
duced to make the equations neater in 
the analytical approach. 
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Fig. 4.7 Simple modifications of the 
potential energy that could be used 
for the numerical solution of the 
Schrédinger equation described in Ex- 
ercise 4.10. For all these potentials 
V(r) = —e?/4neor for r > Treore- (a) 
Inside the radial distance rcore the po- 
tential energy is V(r) = —Ze?/4meor + 
Voftset, Arawn here for Z = 3 and an 
offset chosen so that V(r) is continuous 
at r = fcore. This corresponds to the 
situation where the charge of the core is 
an infinitely thin shell. The deep poten- 
tial in the inner region means that the 
wavefunction has a high curvature, so 
small steps must be used in the numeri- 
cal calculation (in this region). The hy- 
pothetical potentials in (b) and (c) are 
useful for testing the numerical method 
and for showing why the eigenenergies 
of any potential proportional to 1/r at 
long range obey a quantum defect for- 
mula (like eqn 4.1). The form of the 
solution depends sensitively on the en- 
ergy in the outer region r > rcore, but 
in the inner region where |F| < |V(r)| 
it does not, e.g. the number of nodes 
(‘wiggles’) in this region changes slowly 
with energy E. Thus, broadly speak- 
ing, the problem reduces to finding the 
wavefunction in the outer region that 
matches boundary conditions, at r = 
Tcore, that are almost independent of 
the energy—the potential energy curve 
shown in (b) is an extreme example 
that gives useful insight into the be- 
haviour of the wavefunction for more 
realistic central fields. 


4.4.1 Self-consistent solutions 


The numerical method described above, or a more sophisticated one, 
can be used to find the wavefunctions and energies for a given potential 
in the central-field approximation. Now we shall think about how to de- 
termine Vor itself. The potential of the central field in eqn 4.2 includes 
the electrostatic repulsion of the electrons. To calculate this mutual 
repulsion we need to know where the electrons are, i.e. their wavefunc- 
tions, but to find the wavefunctions we need to know the potential. This 
argument is circular. However, going round and round this loop can be 
useful in the following sense. As stated above, the method starts by 
making a reasonable estimate of Vcr and then computing the electronic 
wavefunctions for this potential. These wavefunctions are then used to 
calculate a new average potential (using the central-field approximation) 
that is more realistic than the initial guess. This improved potential is 
then used to calculate more accurate wavefunctions, and so on. On suc- 
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cessive iterations, the changes in the potential and wavefunctions should 
get smaller and converge to a self-consistent solution, i.e. where the wave- 
functions give a certain Vor(r), and solving the radial equation for that 
central potential gives back the same wavefunctions (within the required 
precision).’° This self-consistent method was devised by Hartree. How- 
ever, the wavefunctions of multi-electron atoms are not simply products 
of individual wavefunctions as in eqn 4.5. In our treatment of the excited 
configurations of helium we found that the two-electron wavefunctions 
had to be antisymmetric with respect to the permutation of the electron 
labels. This symmetry requirement for identical fermions was met by 
constructing symmetrised wavefunctions that were linear combinations 
of the simple product states (i.e. the spatial part of these functions is 
a and Ge A convenient way to extend this symmetrisation to 
N particles is to write the wavefunction as a Slater determinant: 


Wa (1) Wa (2) coos Va (N) 
(1) pe(2) --- we) 
v= We (1) We (2) ae we (N) 


oy or. Se 


Here a, b,c,...,x are the possible sets of quantum numbers of the in- 
dividual electrons,'® and 1,2,...,.N are the electron labels. The change 


of sign of a determinant on the interchange of two columns makes the 
wavefunction antisymmetric. The Hartree-Fock method uses such sym- 
metrised wavefunctions for self-consistent calculations and nowadays this 
is the standard way of computing wavefunctions, as described in Brans- 
den and Joachain (2003). In practice, numerical methods need to be 
adapted to the particular problem being considered, e.g. numerical val- 
ues of the radial wavefunctions that give accurate energies may not give 
a good value for a quantity such as the expectation value (1/r?) that is 
very sensitive to the behaviour at short range. 


4.5 The spin-orbit interaction: a quantum 
mechanical approach 


The spin-orbit interaction (@s-1 (see eqn 2.49) splits the energy levels to 
give fine structure. For the single valence in an alkali we could treat this 
interaction in exactly the same way as for hydrogen in Chapter 2, i.e. use 
the vector model that treats the angular momenta as vectors obeying 
classical mechanics (supplemented with rules such as the restriction of 
the angular momentum to integer or half-integer values). However, in 
this chapter we shall use a quantum mechanical treatment and regard the 
vector model as a useful physical picture that illustrates the behaviour 
of the quantum mechanical operators. The previous discussion of fine 
structure in terms of the vector model had two steps that require further 
justification. 


'SThe number of iterations required, 
before the changes when going round 
the loop become very small, depends 
on how well the initial potential is cho- 
sen, but the final self-consistent solu- 
tion should not depend on the initial 
choice. In general, it is better to let a 
computer do the work rather than ex- 
pend a lot of effort improving the start- 
ing point. 


16 Including both space and spin. 
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17 The wavefunction for an alkali metal 
atom in the central-field approxima- 
tion is a product of a radial wavefunc- 
tion (which does not have an analyti- 
cal expression) and angular momentum 
eigenfunctions (as in hydrogen). 


18 Mtore explicitly, we have 
|lmpsms) = Yim, spin, where 
Wspin = \ms = +1/2) or |ms = —1/2). 


19proof of these commutation re- 
lations: [Salz + Syly + szlz, lz] = 
8x [lx lz] + Sy [ly lz] = —isaly tisyle F 
0. Similarly, [szlz + Syly + szlz, sz] = 
—isyle + iszly Ff O. Note that 
[s-1,l,] = —[s-1,s,] and hence s -1 
commutes with lz + sz. 


Fig. 4.8 The total angular momen- 
tum of the atom j = 1+ ss is a fixed 
quantity in the absence of an exter- 
nal torque. Thus an interaction be- 
tween the spin and orbital angular mo- 
menta $s -1 causes these vectors to ro- 
tate (precess) around the direction of j 
as shown. 


20 As for helium in Section 3.2 and in 
the classical treatment of the normal 
Zeeman effect in Section 1.8. 


21 These commutation relations for the 
operators correspond to the conserva- 
tion of the total angular momentum, 
and its component along the z-axis. 
Only an external torque on the atom 
affects these quantities. The spin-orbit 
interaction is an internal interaction. 


(a) The possible values of the total angular momentum obtained by 
the addition of the electron’s spin, s = 1/2, and its orbital angular 
momentum are j =1+1/2 or! —1/2. This is a consequence of the 
rules for the addition of angular momentum in quantum mechanics 
(vector addition but with the resultant quantised). 


(b) The vectors have squared magnitudes given by j? = j(j + 1), P = 
I(1 +1) and s* = 3/4, where j and / are the relevant angular mo- 
mentum quantum numbers. 


Step (b) arises from taking the expectation values of the quantum op- 
erators in the Hamiltonian for the spin-orbit interaction. This is not 
straightforward since the atomic wavefunctions R(r) |l mj sms) are not 
eigenstates of this operator!’—this means that we must face the com- 
plications of degenerate perturbation theory. This situation arises fre- 
quently in atomic physics and merits a careful discussion. 

We wish to determine the effect of an interaction of the form s- 1 
on the angular eigenfunctions |lm;sm,). These are eigenstates of the 
operators I”, 1,, s? and s, labelled by the respective eigenvalues.!* There 
are 2(21+ 1) degenerate eigenstates for each value of | because the energy 
does not depend on the orientation of the atom in space, or the direction 
of its spin, i.e. energy is independent of m; and m,;. The states |! m; s mg) 
are not eigenstates of s-1 because this operator does not commute with 
l, and s,: [s-1,l,] #0 and [s-1, s.] 4 0.19 Quantum operators only 
have simultaneous eigenfunctions if they commute. Since |! m;,smzg) is 
an eigenstate of /, it cannot simultaneously be an eigenstate of s-1, and 
similarly for s,. However, s-1 does commute with F? ands”: [s-1, ?] =0 
and {s-1,s?] = 0 (which are easy to prove since sz, Sy, 82, ly, ly and 
L, all commute with s? and I”). So 1 and s are good quantum numbers 
in fine structure. Good quantum numbers correspond to constants of 
motion in classical mechanics—the magnitudes of 1 and s are constant 
but the orientations of these vectors change because of their mutual 
interaction, as shown in Fig. 4.8. If we try to evaluate the expectation 
value using wavefunctions that are not eigenstates of the operator then 
things get complicated. We would find that the wavefunctions are mixed 
by the perturbation, i.e. in the matrix formulation of quantum mechanics 
the matrix representing the spin-orbit interaction in this basis has off- 
diagonal elements. The matrix could be diagonalised by following the 
standard procedure for finding the eigenvalues and eigenvectors,?° but 
a p-electron gives six degenerate states so the direct approach would 
require the diagonalisation of a 6 x6 matrix. It is much better to find the 
eigenfunctions at the outset and work in the appropriate eigenbasis. This 
‘look-before-you-leap’ approach requires some preliminary reasoning. 

We define the operator for the total angular momentum as j =1+s. 
The operator j? commutes with the interaction, as does its component 
je: [s-1,j?] = 0 and [s-1, j.] = 0. Thus j and m, are good quantum 
numbers.”! Hence suitable eigenstates for calculating the expectation 
value of s-1 are |I sj m,;). Mathematically these new eigenfunctions can 
be expressed as combinations of the old basis set: 


isgm,;) = ye C(lsjmj; m,ms) |lmz sms) . 
M1,Ms5 
Each eigenfunction labelled by /, s, 7 and m, is a linear combination of 
the eigenfunctions with the same values of | and s but various values 
of m, and m,. The coefficients C are the Clebsch—Gordan coefficients 
and their values for many possible combinations of angular momenta 
are tabulated in more advanced books. Particular values of Clebsch— 
Gordan coefficients are not needed for the problems in this book but 
it is important to know that, in principle, one set of functions can be 
expressed in terms of another complete set—with the same number of 
eigenfunctions in each basis. 
Finally, we use the identity? j? = 1? + s? + 2s-1 to express the 
expectation value of the spin-orbit interaction as 


II 


4 (Isjm;| he = 1 =e \isjm,;) 


1 £j(j +1) -L(L+ 1) —s(s+D}. 


(lsjm;|s-1|lsjmj;) 


The states |lsjm;) are eigenstates of the operators j*, 1? and s*. The 
importance of the proper quantum treatment may not yet be apparent 
since all we appear to have gained over the vector model is being able to 
write the wavefunctions symbolically as |!sjm,;). We will, however, need 
the proper quantum treatment when we consider further interactions 
that perturb these wavefunctions. 


4.6 Fine structure in the alkalis 


The fine structure in the alkalis is well approximated by an empirical 
modification of eqn 2.56 called the Landé formula: 


Zege 
0) a? 


AES = TS (L4 1) 


heh (4.13) 


In the denominator the effective principal quantum number cubed (n*)° 
(defined in Section 4.2) replaces n°. The effective atomic number Zeg, 
which was defined in the discussion of the central-field approximation, 
tends to the inner atomic number Z, ~ Z as r + 0 (where the electron 
‘sees’ most of the nuclear charge); outside the core the field corresponds 
to an outer atomic number Z, ~ 1 (for neutral atoms). The Landé 
formula can be justified by seeing how the central-field approximation 
modifies the calculation of the fine structure in hydrogen (Section 2.3.2). 
The spin-orbit interaction depends on the electric field that the electron 
moves through; in an alkali metal atom this field is proportional to 
Zeg(r)r/r? rather than r/r? as in hydrogen.?* Thus the expectation 
value of the spin-orbit interaction depends on 


(2) (225) 
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22 This applies both for vector opera- 
tors, where j? = jz? + jy? +jz7, and 
for classical vectors where this is simply 
B= (ja 
J = lil’. 


23-This modification is equivalent to us- 
ing Vor in place of the hydrogenic po- 
tential proportional to 1/r. 
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Frequency 


Fig. 4.9 The fine-structure compo- 
nents of a p to s transition, e.g. the 
3 Py/2-3 Sij2 and 3 P3/2-3 Sis2 transi- 
tions in sodium. (Not to scale.) The 
statistical weights of the upper levels 
lead to a 1:2 intensity ratio. 


24The rates of the allowed transitions 
depend on integrals involving the radial 
wavefunctions (carried out numerically 
for the alkalis) and the integrals over 
the angular part of the wavefunction 
given in Section 2.2.1, where we derived 
the selection rules. 


25This shortened form of the full LS- 
coupling scheme notation gives all the 
necessary information for a single elec- 
tron, Gie 3p ?P3/2—38 *Si/2: 


26This must be true for the physical 
reason that the decay rate is the same 
whatever the spatial orientation of the 
atom, and similarly for the spin states. 
All the different angular states have the 
same radial integral, i.e. that between 
the 3p and 3s radial wavefunctions. 


27 This normal situation for fine struc- 
ture may be modified slightly in a case 
like caesium where the large separation 
of the components means that the w® 
factor in the lifetime (eqn 1.24) leads 
to differences, even though the matrix 
elements are similar. 


rather than (1/r*) as in hydrogen (eqn 2.51). This results in fine struc- 
ture for the alkalis, given by the Landé formula, that scales as Z ?__this 
lies between the dependence on Z* for hydrogenic ions (no screening) 
and the other extreme of no dependence on atomic number for complete 
screening. The effective principal quantum number n* is remarkably 
similar across the alkalis, as noted in Section 4.2. 

As a particular numerical example of the scaling, consider the fine 
structure of sodium (Z = 11) and of caesium (Z = 55). The 3p con- 
figuration of sodium has a fine-structure splitting of 1700m~’, so for 
a Z?-dependence the fine structure of the 6p configuration of caesium 
should be (using n* from Table 4.2) 


55\°. (Jae 
; = —— | = 28.5 scoloe mes 
1.7 x 10 x (2) « (3 Sy xe IU raat 


This estimate gives only half the actual value of 55.4 x 10?m7?, but 
the prediction is much better than if we had used a Z* scaling. (A 
logarithmic plot of the energies of the gross and fine structure against 
atomic number is given in Fig. 5.7. This shows that the actual trend of 
the fine structure lies close to the Z*-dependence predicted.) 

The fine structure causes the familiar yellow line in sodium to be a 
doublet comprised of the two wavelengths A = 589.0nm and 589.6nm. 
This, and other doublets in the emission spectrum of sodium, can be 
resolved by a standard spectrograph. In caesium the transitions be- 
tween the lowest energy configurations (6s—6p) give spectral lines at 
A = 852nm and 894nm—this ‘fine structure’ is not very fine. 


4.6.1 Relative intensities of fine-structure 
transitions 


The transitions between the fine-structure levels of the alkalis obey the 
same selection rules as in hydrogen since the angular momentum func- 
tions are the same in both cases. It takes a considerable amount of cal- 
culation to find absolute values of the transition rates** but we can find 
the relative intensities of the transitions between different fine-structure 
levels from a simple physical argument. As an example we shall look 
at p to s transitions in sodium, as shown in Fig. 4.9. The 3 Pj /2-3$1/2 
transition has half the intensity of the 3P3/2-3S;/2 transition.2° This 
1:2 intensity ratio arises because the strength of each component is pro- 
portional to the statistical weight of the levels (27 + 1). This gives 2:4 
for 7 = 1/2 and 3/2. To explain this we first consider the situation 
without fine structure. For the 3p configuration the wavefunctions have 
the form R3p(r) |lmjsm,) and the decay rate of these states (to 3s) is 
independent of the values of m; and m,.2® Linear combinations of the 
states R(r) |lmism,) with different values of m; and m, (but the same 
values of n, 1 and s, and hence the same lifetime) make up the eigen- 
states of the fine structure, |/sjm,;). Therefore an alkali atom has the 
same lifetime for both values of 7.27 


If each state has the same excitation rate, as in a gas discharge lamp 
for example, then all the states will have equal populations and the 
intensity of a given component of the line is proportional to the number 
of contributing m, states. Similarly, the fine structure of transitions 
from s to p configurations, e.g. 5S, /2-3 P3/2 and 5S; /2 3Pi/2, have an 
intensity ratio of 2:1—in this case the lower frequency component has 
twice the intensity of the higher component, i.e. the opposite of the p 
to s transition shown in Fig. 4.9 (and such information can be used to 
identify the lines in an observed spectrum). More generally, there is a 
sum rule for intensities: the sum of the intensities to, or from, a given 
level is proportional to its degeneracy; this can be used when both upper 
and lower configurations have fine structure (see Exercise 4.8). 

The discussion of the fine structure has shown that spin leads to a 
splitting of energy levels of a given n, of which / levels have different 
j. These fine-structure levels are degenerate with respect to m;, but an 
external magnetic field removes this degeneracy. The calculation of the 
effect of an external magnetic field in Chapter 1 was a classical treat- 
ment that led to the normal Zeeman effect. This does not accurately 
describe what happens for atoms with one valence electron because the 
contribution of the spin magnetic moment leads to an anomalous Zee- 
man effect. The splitting of the fine-structure level into 27 + 1 states 
(or Zeeman sub-levels) in an applied field is shown in Fig. 4.10. It is 
straightforward to calculate the Zeeman energy for an atom with a single 
valence electron, as shown in quantum texts, but to avoid repetition the 
standard treatment is not given here; in the next chapter we shall derive 
a general formula for the Zeeman effect on atoms with any number of 
valence electrons that covers the single-electron case (see Exercise 5.13). 
We also look at the Zeeman effect on hyperfine structure in Chapter 6. 


Further reading 


This chapter has concentrated on the alkalis and mentioned the neigh- 
bouring inert gases; a more general discussion of the periodic table is 
given in Physical chemistry by Atkins (1994). 

The self-consistent calculations of atomic wavefunctions are discussed 
in Hartree (1957), Slater (1960), Cowan (1981), in addition to the text- 
book by Bransden and Joachain (2003). 

The numerical solution of the Schrédinger equation for the bound 
states of a central field in Exercise 4.10 is discussed in French and Taylor 
(1978), Eisberg and Resnick (1985) and Rioux (1991). Such numerical 
methods can also be applied to particles with positive energies in the 
potential to model scattering in quantum mechanics, as described in 
Greenhow (1990). The numerical method described in this book has 
deliberately beer kept simple to allow quick implementation, but the 
Numerov method is more precise for this type of problem. 
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Fig. 4.10 In an applied magnetic field 
of magnitude B the four states of differ- 
ent mj of the 7P3/2 level have energies 
of EZeeman = 9j4BBm;—the factor g; 
arises from the projection of the contri- 
butions to the magnetic moment from 
1 and s onto j (see Exercise 5.13). 
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Exercises 


(4.1) 


(4.2) 


(4.4) 


Configuration of the electrons in francium 
Write down the full electronic configuration of 
francium (atomic number Z = 87). This element 
comes below caesium in the periodic table. 


Finding the series limit for sodium 

Eight ultraviolet absorption lines in sodium have 

wavenumbers of 
38 541, 
39 795 , 


40 383, 
40 706 , 


39.299 , 
40 137, 


40 566 , 
40 814, 


in units of cm‘. Devise an extrapolation proce- 
dure to find the ionization limit of sodium with a 
precision justified by the data. Convert the result 
into electron volts. (You may find a spreadsheet 
program useful for manipulating the numbers.) 
What is the effective principal quantum number 
n” of the valence electron in the ground configu- 
ration? 


Quantum defects of sodium 

The binding energies of the 3s, 4s, 5s and 6s con- 
figurations in sodium are 5.14eV, 1.92eV, 1.01 eV 
and 0.63 eV, respectively. Calculate the quantum 
defects for these configurations and comment on 
what you find. 

Estimate the binding energy of the 8s configura- 
tion and make a comparison with the n = 8 shell 
in hydrogen. 

Quantum defect 

Estimate the wavelength of laser radiation that ex- 
cites the 5s *S1/2-78 *S1/2 transition in rubidium 
by simultaneous absorption of two photons with 
the same frequency (IE(Rb) = 4.17eV). (Two- 
photon spectroscopy is described in Section 8.4 but 
specific details are not required here.) 


Application of quantum defects to helium and 
heltum-lke tons 


Configuration Binding energy (cm~*) 
1s2s 35 250 
1s2p 28 206 
Is3s 14 266 
1s3p 12 430 
1s3d. 12214 


(4.6) 


(4.7) 


(4.8) 


(a) Calculate the wavelength of the 1s3d—1s2p line 
in helium and compare it with the Balmer-a 
line in hydrogen. 

Calculate the quantum defects for the config- 
urations of helium in the table. Estimate the 
binding energies of the 1s4l configurations. 


(b) 


The levels belonging to the 1s”4f configuration 
of the Lit ion all lie at an energy of 72.24eV 
above the ion’s ground state. Estimate the 
second ionization energy of this ion. Answer: 
75.64 eV. 


(c) 


Quantum defects and fine structure of potassium 

An atomic vapour of potassium absorbs light at 
the wavelengths (in nm): 769.9, 766.5, 404.7, 
404.4, 344.7 and 344.6. These correspond to the 
transitions from the ground configuration 4s. Ex- 
plain these observations as fully as you can and 
estimate the mean wavelength of the next doublet 
in the series, and its splitting. (Potassium has 
IE =434eV,)"° 


The Z-scaling of fine structure 

Calculate the fine-structure splitting of the 3p con- 
figuration of the hydrogen-like ion Nat?® (in eV). 
Explain why it is larger than the fine structure 
of the same configuration in the neutral sodium 
(0.11 eV) and hydrogen (1.1 x 107° eV). 


Relative intensities of fine-structure components 


(a) An emission line in the spectrum of an al- 
kali has three fine-structure components cor- 
responding to the transitions *D32— P32, 
*Ds5/2— P32 and *D3/o-P 1/2. These compo- 
nents have intensities a,b and c, respectively, 
that are in the ratio 1: 9:5. Show that these 
satisfy the rule that the sum of the intensities 
of the transitions to, or from, a given level is 
proportional to its statistical weight (2 +1). 


Sketch an energy-level diagram of the fine- 
structure levels of the two terms n’f ?F and 
nd °D (for n’ > n). Mark the three allowed 
electric dipole transitions and find their rela- 
tive intensities. 


*8For a discussion of how to determine the quantum defect for a series of lines by an iterative method see Softley (1994). 


(4.9) 


(4.10) 


Spherical symmetry of a full sub-shell 

The sum ya [Yim|° is spherically symmetric. 
Show this for the specific case of 1 = 1 and com- 
ment on the relevance of the general expression, 
that is true for all values of 1, to the central-field 
approximation. 


Numerical solution of the Schrodinger equation 
This exercise goes through a method of finding the 
wavefunctions and their energies for a potential (in 
the central-field approximation). This shows how 
numerical calculations are carried out in a simple 
case that can be implemented easily on a computer 
with readily available spreadsheet programs.”? Of 
course, the properties of hydrogen-like atoms are 
well known and so the first stage really serves 
as a way of testing the numerical method (and 
checking that the formulae have been typed cor- 
rectly). It is straightforward to extend the nu- 
merical method to deal with other cases, e.g. the 
potentials in the central-field approximation illus- 
trated in Fig. 4.7.°° 


(a) Derivation of the equations 
Show from eqn 2.4, and other equations in 
Chapter 2, that 


2 
ati S+(E-F (@)) R(e) =0, (4.14) 
where the position and energy have been 
turned into dimensionless variables: + = r/ao 
and F& is the energy in units of e?/87€0a9 = 
13.6eV (equal to half the atomic unit of en- 
ergy used in some of the references). In 
these units the effective potential is 


i(l+1) 2 
x? xr’ 


V (x) = (4.15) 
where | is the orbital angular momentum 
quantum number. 

The derivatives of a function f (x) can be ap- 
proximated by 


df _ f(x +6/2) + f(e-6/2) 


dr fe) 
Gi free) ae — 0) — 27) 
dz2 62 : 


(b 


SS 
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where 6 is a small step size.*” 

Show that the second derivative follows by ap- 
plying the procedure used to obtain the first 
derivative twice. Show also that substitution 
into eqn 4.14 gives the following expression for 
the value of the function at x +0 in terms of 
its value at the two previous points: 


R(x +6) = {2R(x) ns (Vie) = E) R(x)6? 


“(tS ae-o}/ (oe) 


(4.16) 
If we start the calculation near the origin then 
R (26) = 5 (2+ (V () - B) 9 *} R(6), 
R(36) = 5 (2R (28) + (¥ (28) - B) R (26) &? 
+ R(5)}, 


etc. Note that in the first equation the value of 
R(x) at x = 26 depends only on R(d)—it can 
easily be seen why by inspection of eqn 4.16 
for the case of x = 6 (for this value of x the 
coefficient of R(0) is zero). Thus the calcula- 
tion starts at x = 6 and works outwards from 
there.*? At all other positions (x > 6) the 
value of the function depends on its values at 
the two preceding points. From these recur- 
sion relations we can calculate the function at 
all subsequent points. 

The calculated functions will not be nor- 
malised and the starting conditions can be 
multiplied by an arbitrary constant without 
affecting the eigenenergies, as will become 
clear from looking at the results. In the fol- 
lowing R(d) = 1 is the suggested choice but 
any starting value works. 


Implementation of the numerical method 
using a spreadsheet program 
Follow these instructions. 


1. Type the given text labels into cells Al, B1, 
C1, D2, E2 and F2 and the three numbers 
into cells D1, El and F1 so that it has the 
following form: 


29With a spreadsheet it is very easy to make changes, e.g. to find out how different potentials affect the eigenenergies and 
wavefunctions. 

30Tt is intended to put more details on the web site associated with this book, see introduction for the address. 

31The electron mass Me = 1 in these units. Or, more strictly, its reduced mass. 

32'This abbreviation for 6r should not be confused with the quantum defect. 

33This example is an exception to the general requirement that the solution of a second-order differential equation, such as 
that for a harmonic oscillator, requires a knowledge of the function at two points to define both the value of the function and 
its derivative. 
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1 
2 


A 


x 


B CD E F 
V(x) psi 0.02 —0.25 1 


step energy ang.mom. 


Column A will contain the z-coordinates, 
the potential will be in column B and the 
function in column C. Cells D1, El and Fl 
contain the step size, energy and orbital an- 
gular momentum quantum number (/ = 1), 
respectively. 


. Put 0 into A2 and the formula =A2+$D$1 


into A3. Copy cell A3 to the block 
A4:A1002. (Or start with a smaller num- 
ber of steps and adjust D1 accordingly.) 


. The potential diverges at + = 0 so type 


inf. into B2 (or leave it blank, remember- 
ing not to refer to it). 
Put the formula 


=-2/A3 +$F$1* ($F$1+1) /(A3*A3) 


into cell B3 (as in eqn 4.15). Copy B3 into 
the block B4:B1002. 


. This is the crucial stage that calculates 


the function. Type the number 1 into cell 
C3. (We leave C2 blank since, as explained 
above, the value of the function at x = 0 
does not affect the solution given by the re- 
cursion relation in eqn 4.16.) Now move to 
cell C4 and enter the following formula for 
the recursion relation: 


=( 2*C3+ (B3-$E$1) #C3*$D$1*$D$1 
- (1-$D$1/A3)*C2 )/ (1+$D$1/A3) . 


Copy this into the block C5:C1002. Create 
an xy-plot of the wavefunction (with data 
points connected by smooth lines and no 
markers); the « series is A2:A1002 and the 
y series is C2:C1002. Insert this graph on 
the sheet. 


Now play around with the parameters and 
observe the effect on the wavefunction for 
a particular energy. 


(i) Show that the initial value of the func- 
tion does not affect its shape, or the 
eigenenergy, by putting 0.1 (or any 
number) into cell C3. 


(ii) Change the energy, e.g. put -0.251 
into cell El, then -0.249, and ob- 
serve the change in behaviour at large 


x. (The divergence is exponential, so 
even a small energy discrepancy gives 
a large effect.) Try the different ener- 
gies again with bigger and smaller step 
sizes in D1. It is important to search 
for the eigenenergy using an appropri- 
ate range of x. The eigenenergy lies 
between the two values of the trial en- 
ergy that give opposite divergence, i.e. 
upwards and downwards on the graph. 


(iii) Change F1 to 0 and find a solution for 
= (0). 


. Produce a set of graphs labelled clearly 


with the trial energy that illustrate the 
principles of the numerical solution, for the 
two functions with n = 2 and two other 
cases. Compare the eigenenergies with the 
Bohr formula. 

Calculate the effective principal quantum 
number for each of the solutions, e.g. by 
putting =SQRT(-1/E1) in Gl (and the label 
n* in G2). 

(The search for eigenenergies can be auto- 
mated by exploiting the spreadsheet’s abil- 
ity to optimise parameters subject to con- 
straints (e.g. the ‘Goal Seek’ command, or 
similar). Ask the program to make the last 
value of the function (in cell C1002) have 
the value of zero by adjusting the energy 
(cell E1). This procedure can be recorded 
as a macro that searches for the eigenener- 
gies with a single button click.) 


. Implement one, or more, of the following 


suggestions for improving the basic method 
described above. 


(i) Find the eigenenergies for a potential 
that tends to the Coulomb potential 
(—2/a in dimensionless units) at long 
range, like those shown in Fig. 4.7, 
and show that the quantum defects for 
that potential depend on | but only 
weakly on n. 


(ii) For the potential shown in Fig. 4.7(c) 
compare the wavefunction in the inner 
and outer regions for several different 
energies. Give a qualitative explana- 
tion of the observed behaviour. 


(ili) Calculate the function P(r) = rR(r) 


by putting A3*C3 in cell D3 and copy- 
ing this to the rest of the column. 


(iv) 


Make a plot of P(r), R(r) and V(r) for 
at least two different values of n and 
1. Adjust the value in C3, as in stage 
5(i), to scale the functions to conve- 
nient values for plotting on the same 
axes as the potential. 


Attempt a semi-quantitative calcula- 
tion of the quantum defects in the 
lithium atom, e.g. model Vcr(r) as in 
Fig. 4.7(a) for some reasonable choice 
Ol Yeas 


Numerically calculate the sum of 
r’?R*(r)6 for all the values of the 
function and divide through by its 
square root to normalise the wave- 
function. With normalised functions 
(stored in a column of the spread- 


(vi) 
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sheet) you can calculate the electric 
dipole matrix elements (and their ra- 
tios), e.g. |(3p| 7 |2s)|” / |(3p| r |1s)|? = 
36, as in Exercise 7.6 (not forgetting 
the w® factor from eqn 7.23). 


Assess the accuracy of this numerical 
method by calculating some eigenen- 
ergies using different step sizes. (More 
sophisticated methods of numerical in- 
tegration provided in mathematical 
software packages can be compared to 
the simple method, if desired, but the 
emphasis here is on the atomic physics 
rather than the computation. Note 
that methods that calculate higher 
derivatives of the function cannot cope 
with discontinuities in the potential.) 


34This simple model corresponds to all the inner electron charge being concentrated on a spherical shell. Making the tran- 
sition from the inner to outer regions smoother does not make much difference to the qualitative behaviour, as you can check 


with the program. 
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1Choosing S(r) to account for all 
the repulsion between the spherically- 
symmetric core and the electrons out- 
side the closed shells, and also within 
the core, leaves the repulsion between 
the two valence electrons, i.e. Hre ~ 
e?/4n€0ri2- This approximation high- 
lights the similarity with helium (al- 
though the expectation value is eval- 
uated with different wavefunctions). 
Although it simplifies the equations 
nicely, this is not the best approxi- 
mation for accurate calculations—S(r) 
can be chosen to include most of the 
direct integral (cf. Section 3.3.2). For 
alkali metal atoms, which we studied in 
the last chapter, the repulsion between 
electrons gives a spherically-symmetric 
potential, so that Hye = 0. 


For two p-electrons we cannot ignore 
my, as we did in the treatment of Isnl 
configurations in helium. Configura- 
tions with one, or more, s-electrons can 
be treated in the way already described 
for helium but with the radial wave- 
functions calculated numerically. 


The LS-coupling scheme 


In this chapter we shall look at atoms with two valence electrons, e.g. al- 
kaline earth metals such as Mg and Ca. The structures of these elements 
have many similarities with helium, and we shall also use the central- 
field approximation that was introduced for the alkalis in the previous 
chapter. We start with the Hamiltonian for N electrons in eqn 4.2 and 
insert the expression for the central potential Vcr (r) (eqn 4.3) to give 


he Ne/4 
Se a ee 


ij 


S(ri) 


= j>t 


This Hamiltonian can be written as H = Hor + Aye, where the central- 
field Hamiltonian Hop is that defined in eqn 4.4 and 


a 2 TE 
Bn = 4 8 — 5 


v= E> Mig 


(5.1) 


is the residual electrostatic interaction. This represents that part of the re- 
pulsion not taken into account by the central field. One might think that 
the field left over is somehow non-central. This is not necessarily true. 
For configurations such as 1s2s in He, or 3s4s in Mg, both electrons have 
spherically-symmetric distributions but a central field cannot completely 
account for the repulsion between them—a potential Vcr(r) does not in- 
clude the effect of the correlation of the electrons’ positions that leads 
to the exchange integral.’ The residual electrostatic interaction perturbs 
the electronic configurations nl; n2l2 that are the eigenstates of the cen- 
tral field. These angular momentum eigenstates for the two electrons are 
products of their orbital and spin functians |l,mj,$1™ms,) |lemi,s2M5,) 
and their energy does not depend on the atom’s orientation so that all 
the different m; states are degenerate, e.g. the configuration 3p4p has 
(21, + 1) (2lg + 1) = 9 degenerate combinations of Yj, .m,Yiz,m>-7 Each 
of these spatial states has four spin functions associated with it, but 
we do not need to consider thirty-six degenerate states since the prob- 
lem separates into spatial and spin parts, as in helium. Nevertheless, 
the direct approach would require diagonalising matrices of larger di- 
mensions than the simple 2 x 2 matrix whose determinant was given in 
eqn 3.17. Therefore, instead of that brute-force approach, we use the 
‘look-before-you-leap’ method that starts by finding the eigenstates of 
the perturbation H;e. In that representation, Hye is a diagonal matrix 
with the eigenvalues as its diagonal elements. 


The interaction between the electrons, from their electrostatic repul- 
sion, causes their orbital angular momenta to change, i.e. in the vector 
model 1; and ly change direction, but their magnitudes remain constant. 
This internal interaction does not change the total orbital angular mo- 
mentum L = 1, +1., so], and ly move (or precess) around this vector, as 
illustrated in Fig. 5.1. When no external torque acts on the atom, L has 
a fixed orientation in space so its z-component Mz is also a constant of 
the motion (m;, and m), are not good quantum numbers). This classical 
picture of conservation of total angular momentum corresponds to the 
quantum mechanical result that the operators L? and L. both commute 
with H,.:3 


[Eee 0° sand [el 0. (52) 
Since H,- does not depend on spin it must also be true that 
[eee 0 and ([S.27.)=0. (5.3) 


Actually, H;e also commutes with the individual spins s; and sg but 
we chose eigenfunctions of S to antisymmetrise the wavefunctions, as 
in helium—the spin eigenstates for two electrons are wearin and ome for 
S = 0 and 1, respectively.4 The quantum numbers L,M,,S and Ms 
have well-defined values in this Russell-Saunders or LS-coupling scheme. 
Thus the eigenstates of H,. are |LM,SMs). In the LS-coupling scheme 
the energy levels labelled by L and S are called terms (and there is 
degeneracy with respect to M; and Ms). We saw examples of 'I and 
37, terms for the 1sn/ configurations in helium where the LS-coupling 
scheme is a very good approximation. A more complex example is an 
npn’'p configuration, e.g. 3p4p in silicon, that has six terms as follows: 


fie ST = 0 or 2e 
il if 
= 82= 5 = © = orl 
terms: op S, Peep 2S) 3P 2p. 


The direct and exchange integrals that determine the energies of these 
terms are complicated to evaluate (see Woodgate (1980) for details) 
and here we shall simply make some empirical observations based on 
the terms diagrams in Figs 5.2 and 5.3. The (2l; +1) (2l2+1) = 9 
degenerate states of orbital angular momentum become the 1 + 3 + 
5 = 9 states of Mz associated with the S, P and D terms, respectively. 
As in helium, linear combinations of the four degenerate spin states 
lead to triplet and one singlet terms but, unlike helium, triplets do not 
necessarily lie below singlets. Also, the 3p? configuration has fewer terms 
than the 3p4p configuration for equivalent electrons, because of the Pauli 
exclusion principle (see Exercise 5.6). 

In the special case of ground configurations of equivalent electrons the 
spin and orbital angular momentum of the lowest-energy term follow 
some empirical rules, called Hund’s rules: the lowest-energy term has. 
the largest value of S consistent with the Pauli exclusion principle.° If 
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Fig. 5.1 The residual electrostatic in- 
teraction causes 1; and leg to precess 
around their resultant L = 1, + lo. 


3The proof is straightforward for the 
quantum operator: L, = 11, +12, since 
mj, = q always occurs with mj), = —q 
in eqn 3.30. 


4The Hamiltonian H commutes with 
the exchange (or swap) operator Xj; 
that interchanges the labels of the par- 
ticles 2 + 9; thus states that are simul- 
taneously eigenfunctions of both oper- 
ators exist. This is obviously true for 
the Hamiltonian of the helium atom in 
eqn 3.1 (which looks the same if 1 + 2), 
but it also holds for eqn 5.1. In general, 
swapping particles with the same mass 
and charge does not change the Hamil- 
tonian for the electrostatic interactions 
of a system. 


5Two electrons cannot both have the 
same set of quantum numbers. 
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Fig. 5.2 The terms of the 3p4p configu- 
ration in silicon all lie about 6eV above 
the ground state. The residual electro- 
static interaction leads to energy dif- 
ferences of ~ 0.2eV between the terms, 
and the fine-structure splitting is an or- 
der of magnitude smaller, as indicated 
for the P and 3D terms. This structure 
is well described by the LS-coupling 
scheme. 


Fig. 5.3 The energies of terms of the 
3p? configuration of silicon. For equiv- 
alent electrons the Pauli exclusion prin- 
ciple restricts the number of terms— 
there are only three compared to the 
six in Fig. 5.2. The lowest-energy term 
is °P, in accordance with Hund’s rules, 
and this is the ground state of silicon 
atoms. 


®Hund’s rules are so commonly mis- 
applied that it is worth spelling out 
that they only apply to the lowest term 
of the ground configuration for cases 
where there is only one incomplete sub- 
shell. 


"The large total spin has important 
consequences for magnetism (Blundell 
2001). 
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there are several such terms then the one with the largest LD is lowest. 
The lowest term in Fig. 5.3 is consistent with these rules;® the rule says 
nothing about the ordering of the other terms (or about any of the 
terms in Fig. 5.2). Configurations of equivalent electrons are especially 
important since they occur in the ground configuration of elements in 
the periodic table, e.g. for the 3d® configuration in iron, Hund’s rules 
give the lowest term as °D (see Exercise 5.6).” 
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5.1 Fine structure in the LS-coupling 
scheme 


Fine structure arises from the spin-orbit interaction for each of the un- 
paired electrons given by the Hamiltonian 


Aso = 8181-1 + Bese: ly. 


For atoms with two valence electrons H,_, acts as a perturbation on 
the states |LM;,;SMs). In the vector model, this interaction between 
the spin and orbital angular momentum causes L and S to change di- 
rection, so that neither LZ, nor S$, remains constant; but the total elec- 
tronic angular momentum J = L +S, and its z-component J,, are both 
constant because no external torque acts on the atom. We shall now 
evaluate the effect of the perturbation H,_, on a term using the vec- 
tor model. In the vector-model description of the LS-coupling scheme, 
1, and ly precess around L, as shown in Fig. 5.4; the components per- 
pendicular to this fixed direction average to zero (over time) so that 
only the component of these vectors along L needs to be considered, 
e.g. {(l-L) /|L|?}L. The time average I, -L in the vector model 
becomes the expectation value (1, -L) in quantum mechanics; also we 
have to use L(L +1) for the magnitude-squared of the vector. Applying 
the same projection procedure to the spins leads to 


oe (s; -S) : (1, - L) (so - S) . (Ip - L) 
J meee neh see Eee” 
= $rsS-L (5.4) 


The derivation of this equation by the vector model that argues by 
analogy with classical vectors can be fully justified by reference to the 
theory of angular momentum. It can be shown that, in the basis |J M7) 
of the eigenstates of a general angular momentum operator J and its 
component J,, the matrix elements of any vector operator V are pro- 
portional to those of J, ie. (J Mj|V|J Ms) =c(J My|JI|J Mz).® Fig- 
ure 5.5 gives a pictorial representation of why it is only the component of 
V along J that is well defined. We want to apply this result to the case 
where V = |, or ly in the basis of eigenstates |L Mz), and analogously 
for the spins. For (LZ Mz|], |Z Mr) = c(Z M,|L|L M_z) the constant 
cis determined by taking the dot product of both sides with L to give 


Ae (EM,|],-L|LM_z) | 
(LM,|L-L|ZL Mz)’ 
hence 
(L Mz\h|£L Mz) = attic (L M,|L|L Mz). 
L(L +1) 
This is an example of the projection theorem and can also be applied to 
l, and to s; and sz in the basis of eigenstates |S Mg). It is clear that, 
for diagonal matrix elements, these quantum mechanical results give the 
same result of the vector model. 


(5.5) 


Fig. 5.4 In the LS-coupling scheme 
the orbital angular momenta of the two 
electrons couple to give total angular 
momentum L = 1], + ly. In the vec- 
tor model 1, and lg precess around L; 
similarly, sj and s2 precess around S. 
L and S precess around the total an- 
gular momentum J (but more slowly 
than the precession of ], and lp around 
L because the spin-orbit interaction is 
‘weaker’ than the residual electrostatic 
interaction). 


8This is particular case of a more 
general result called the Wigner- 
Eckart theorem which is the corner- 
stone of the theory of angular momen- 
tum. This powerful theorem also ap- 
plies to off-diagonal elements such as 
(J M;|V|JM!,), and to more com- 
plicated operators such as those for 
quadrupole moments. It is used ex- 
tensively in advanced atomic physics— 
see the ‘Further reading’ section in this 
chapter. 


Fig. 5.5 A pictorial representation of 
the project theorem for an atom, where 
J defines the axis of the system. 
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9 Similarly, in the one-electron case we 
found the fine structure without deter- 
mining the eigenstates |lsjm,j) explic- 
itly in terms of the Y;,,, and spin func- 
tions. 


26 B 
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Fig. 5.6 The fine structure of a °P 
term obeys the interval rule. 


10Tn classical mechanics the word ‘cou- 
pling’ is commonly used more loosely, 
e.g. for coupled pendulums, or coupled 
oscillators, the ‘coupling between them’ 
is taken to mean the ‘interaction be- 
tween them’ that leads to their motions 
being coupled. (This coupling may take 
the form of a physical linkage such as a 
rod or spring between the two systems.) 


Hg a ~ Brs and Ere is comparable 
to the exchange integral. 


Equation 5.4 has the same form as the spin-orbit interaction for the 
single-electron case but with capital letters rather than s-1. The constant 
Brg that gives the spin-orbit interaction for each term is related to that 
for the individual electrons (see Exercise 5.2). The energy shift is 


Es—o = Bis (S ' L) : 


To find this energy we need to evaluate the expectation value of the 
operator L-S = (J-J-—L-L—S.-S) /2 for each term 75+1L. Each 
term has (2S + 1)(2L +1) degenerate states. Any linear combination 
of these states is also an eigenstate with the same electrostatic energy 
and we can use this freedom to choose suitable eigenstates and make the 
calculation of the (magnetic) spin-orbit interaction straightforward. We 
shall use the states |L.S.J.M 7); these are linear combinations of the basis 
states |[Mz,SMs) but we do not need to determine their exact form to 
find the eigenenergies.® Evaluation of eqn 5.6 with the states |LS.JM,,) 
gives 


(5.6) 


By. = ES (J(3+1)-L(L+1)-S(8 +0}. (5.7) 
Thus the energy interval between adjacent J levels is 
AEps = £7 — Ej-1 = Orci (5.8) 


This is called the interval rule. For example, a °P term (L = 1 = S) has 
three J levels: ?°+1L, = Po, ?P1, ?P2 (see Fig. 5.6); and the separation 
between J = 2 and J = 1 is twice that between J = 1 and J = 0. The 
existence of an interval rule in the fine structure of a two-electron system 
generally indicates that the L.S-coupling scheme is a good approximation 
(see the ‘Exercises’ in this chapter); however, the converse is not true. 
The LS-coupling scheme gives a very accurate description of the energy 
levels of helium but the fine structure does not exhibit an interval rule 
(see Example 5.2 later in this chapter). 

It is important not to confuse LS-coupling (or Russell-Saunders cou- 
pling) with the interaction between L and S given by 6,5S8-L. In this 
book the word interaction is used for real physical effects described by 
a Hamiltonian and coupling refers to the forming of linear-combination 
wavefunctions that are eigenstates of angular momentum operators, e.g. 
eigenstates of L and S. The LS-coupling scheme breaks down as the 
strength of the interaction Gs S - L increases relative to that of H,-.° 


5.2 The 77-coupling scheme 


To calculate the fine structure in the L.S-coupling scheme we treated the 
spin-orbit interaction as a perturbation on a term, ?5+!L. This is valid 
when Ere > Es—o, which is generally true in light atoms.!! The spin— 
orbit interaction increases with atomic number (eqn 4.13) so that it can 
be similar to Ee for heavy atoms-——see Fig. 5.7. However, it is only in 
cases with particularly small exchange integrals that E,_, exceeds E, 


Tes 
so that the spin-orbit interaction must be considered before the residual 
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electrostatic interaction. When H,_, acts directly on a configuration it 
causes the 1 and s of each individual electron to be coupled together to 
give j,) = 1, +s; and jo = lg + sg; in the vector model this corresponds 
to 1 and s precessing around j independently of the other electrons. 
In this 77-coupling scheme each valence electron acts on its own, as 
in alkali atoms. For an sp configuration the s-electron can only have 
ji = 1/2 and the p-electron has jg = 1/2 or 3/2; so there are two 
levels, denoted by (j1, jo) = (1/2,1/2) and (1/2,3/2). The residual 
electrostatic interaction acts as a perturbation on the 77-coupled levels; 
it causes the angular momenta of the electrons to be coupled to give 
total angular momentum J = ji + jo (as illustrated in Fig. 5.8). Since 
there is no external torque on the atom, M, is also a good quantum 
number. For an sp configuration there are pairs of J levels for each of the 
two original jj-coupled levels, e.g. (71, je); = (1/2,1/2)), (1/2,1/2), 
and (1/2,3/2), , (1/2,3/2).. This doublet structure, shown in Fig. 5.10, 
contrasts with the singlets and triplets in the LS-coupling scheme. 


5.2. The jj-coupling scheme 85 


Fig. 5.7 A plot of typical energies as a 
function of the atomic number Z (on 
logarithmic scales). A characteristic 
energy for the gross structure is taken 
as the energy required to excite an elec- 
tron from the ground state to the first 
excited state—this is less than the ion- 
ization energy but has a similar varia- 
tion with Z. The residual electrostatic 
interaction is the singlet—triplet separa- 
tion of the lowest excited configuration 
in some atoms with two valence elec- 
trons. The fine structure is the split- 
ting of the lowest p configuration. For 
all cases the plotted energies are fairly 
close to the maximum for that type 
of structure in neutral atoms—higher- 
lying configurations have smaller val- 
ues. 
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Fig. 5.8 The 7j-coupling scheme. The 
spin-orbit interaction energy is large 
compared to the Ere. (Cf. Fig. 5.4 for 
the LS-coupling scheme.) 


121n both of these cases we assume an 
isolated configuration, i.e. that the en- 
ergy separation of the different config- 
urations in the central field is greater 
than the perturbation produced by ei- 
ther Es—o or Ere. 


In summary, the conditions for LS- and jj-coupling are as follows:'? 


LS-coupling scheme: Eye > Es_—o, 
4j-coupling scheme: Es_o > Eve. 


5.3 Intermediate coupling: the transition 


between coupling schemes 
In this section we shall look at examples of angular momentum coupling 
schemes in two-electron systems. Figure 5.9 shows energy-level diagrams 
of Mg and Hg and the following example looks at the structure of these 
atoms. 


Example 5.1 


3s3p, Mg 6s6p, Hg 


2.1850 3.76 
2.1870 3.94 
Pa WSU 4.40 
3.5051 5.40 


The table gives the energy levels, in units of 10°m~! measured from 
the ground state, for the 3s3p configuration in magnesium (Z = 12) and 
6s6p in mercury (Z = 80). We shall use these data to identify the levels 
and assign further quantum numbers. 

For an sp configuration we expect 'P and ?P terms. In the case of 
magnesium we see that the spacings between the three lowest levels are 
2000 m~! and 4100 m~!; these are close to the 1 to 2 ratio expected from 
the interval rule for the levels with J = 0,1 and 2 that arise from the 
triplet. The LS-coupling scheme gives an accurate description because 
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Fig. 5.9 The terms of helium, magnesium and mercury are plotted on the same energy scale (with hydrogen on the left for 
comparison). The fine structure of the lighter atoms is too small to be seen on this scale and the LS-coupling scheme gives 
a very accurate description. This scheme gives an approximate description for the low-lying terms of mercury even though it 
has a much larger fine structure, e.g. for the 6s6p configuration the Ere > Es—o but the interval rule is not obeyed because 
the spin-orbit interaction is not very small compared to the residual electrostatic interaction. The 1s? configuration of helium 
is not shown; it has a binding energy of —24.6eV (see Fig. 3.4). The 1s2s and 1s2p configurations of helium lie close to the 
n = 2 shell in hydrogen, and similarly the 1s3/ configurations lie close to the n = 3 shell. In magnesium, the terms of the 
3snf configurations have very similar energies to those in hydrogen, but the differences get larger as | decreases. The energies 
of the terms in mercury have large differences from the hydrogen energy levels. Much can be learnt by carefully studying this 
term diagram, e.g. there is a 1P term which has similar energy in the three configurations: 1s2p, 3s3p and 6s6p in He, Mg and 
Hg, respectively—thus the effective quantum number n* is similar despite the increase in n. Complex terms arise when both 
valence electrons are excited in Mg, e.g. the 3p? configuration, and the 5d°6s*6p configuration in Hg. 


88 The LS-coupling scheme 


13This identification of the levels is 
supported by other information, e.g. 
determination of J from the Zeeman ef- 
fect and the theoretically predicted be- 
haviour of an sp configuration shown in 
Fig. 5.10. 


oe 


the fine structure is much smaller than the energy separation (Ere ~ 
1.3 x 10°m~!) between the °P term at ~ 2.2 x 10°m~! and the 'P, 
level at 3.5 x 10°m7!. In mercury the spacings of the levels, going 
down the table, are 0.18, 0.46 and 1.0 (in units of 10° m~'); these 
levels are not so clearly separated into a singlet and triplet. Taking the 
lowest three levels as 7Po, ?P; and *P2 we see that the interval rule is 
not well obeyed since 0.46/0.18 = 2.6 (not 2).1% This deviation from 
the LS-coupling scheme is hardly surprising since this configuration has 
a spin-orbit interaction only slightly smaller than the singlet-triplet 
separation. However, even for this heavy atom, the L.S-coupling scheme 
gives a closer approximation than the 77-coupling scheme. 
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Fig. 5.10 A theoretical plot of the energy levels that arise from an sp configuration as a function of the strength of the 
spin-orbit interaction parameter 8 (of the p-electron defined in eqn 2.55). For 8 = 0 the two terms, ?P and P, have an energy 
separation equal to twice the exchange integral; this residual electrostatic energy is assumed to be constant and only @ varies in 
the plot. As 8 increases the fine structure of the triplet becomes observable. As 6 increases further the spin-orbit and residual 
electrostatic interactions become comparable and the LS-coupling scheme ceases to be a good approximation: the interval rule 
and (L.S-coupling) selection rules break down (as in mercury, see Fig. 5.9). At large 8 the jj-coupling scheme is appropriate. 
The operator J commutes with Hs» (and Hre); therefore H;_. only mixes levels of the same J, e.g. the two J = 1 levels in 
this case. (The energies of the J = 0 and 2 levels are straight lines because their wavefunctions do not change.) Exercise 5.8 
gives an example of this transition between the two coupling schemes for np(n + 1)s configurations with n = 3 to 5 (that have 


small exchange integrals). 
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Example 5.2 The 1s2p configuration in helium 


dE Gn *) 
2 16908687 
1 16908694 
0 16908793 
1 


17 113 500 


The table gives the values of J and the energy, in units of m~! measured 
from the ground state, for the levels of the 1s2p configuration in helium. 
The ?P term has a fine-structure splitting of about 100m7! that is 
much smaller than the singlet-triplet separation of 10°m~! from the 
electrostatic interaction (twice the exchange integral). Thus the LS- 
coupling scheme gives an excellent description of the helium atom and 
the selection rules in Table 5.1 are well obeyed. But the interval rule is 
not obeyed—the intervals between the J levels are 7m~! and 99 m~! and 
the fine structure is inverted. This occurs in helium because spin-spin 
and spin—other-orbit interactions have an energy comparable with that of 
the spin-orbit interaction.!4 However, for atoms other than helium, the 
rapid increase in the strength of the spin-orbit interaction with Z ensures 
that H,_, dominates over the others. Therefore the fine structure of 
atoms in the LS-coupling scheme usually leads to an interval rule. 


Further examples of energy levels are given in the exercises at the end 
of this chapter. Figure 5.10 shows a theoretical plot of the transition 
from the LS- to the 77-coupling scheme for an sp configuration. Conser- 
vation of the total angular momentum means that J is a good quantum 
number even in the intermediate coupling regime and can always be used 
to label the levels. The notation 2°*+!L, for the LS-coupling scheme is 
often used even for systems in the intermediate regime and also for one- 
electron systems, e.g. 1s 7S, /2 for the ground state of hydrogen. 


Table 5.1 Selection rules for electric dipole (E1) transitions in the LS-coupling 
scheme. Rules 1—4 apply to all electric dipole transitions; rules 5 and 6 are obeyed 
only when L and S are good quantum numbers. The right-hand column gives the 
structure to which the rule applies. 


oe) =O cel (i= Ore 0) Level 
Dee Gy — 0, acl (M; =0“ M; =0if AJ =0) State 
3 Parity changes Configuration 
Ae = bl One electron jump Configuration 
5 SAL = 0s (L=0“L' =0) Term 
6) As = 0 Term 


14The spin-spin interaction arises from 
the interaction between two magnetic 
dipoles (independent of any relative 
motion). See eqn 6.12 and its expla- 
nation. 
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15There is no simple physical ex- 
planation of why an M; = O to 
My; = 0 transition does not occur 
if J = J’; it is related to the sym- 
metry of the dipole matrix element 
(yJ Mj = O|r\y'J My; =0), where y¥ 
and y’ represent the other quantum 
numbers. The particular case of J = 
J’ = 1 and AM, = 0 is discussed in 
Budker et al. (2003). 


16 This line comes from the second level 
in the table given in Example 5.1, since 
O54 m= 1/(3.941 x 10°m-*). 


17tntercombination lines are not ob- 
served in magnesium and helium. The 
relative strength of the intercombina- 
tion lines and allowed transitions are 
tabulated in Kuhn (1969). 


18 fost quantum mechanics texts de- 
scribe the anomalous Zeeman effect for 
a single valence electron that applies to 
the alkalis and hydrogen. 


5.4 Selection rules in the DS-coupling 
scheme 


Table 5.1 gives the selection rules for electric dipole transitions in the 
LS-coupling scheme (listed approximately in the order of their strict- 
ness). The rule for J reflects the conservation of this quantity and is 
strictly obeyed; it incorporates the rule for Aj in eqn 2.59, but with 
the additional restriction J = 0 # J’ = 0 that affects the levels with 
J = 0 that occur in atoms with more than one valence electron. The 
rule for AM, follows from that for AJ: the emission, or absorption, of 
a photon cannot change the component along the z-axis by more than 
the change in the total atomic angular momentum. (This rule is rele- 
vant when the states are resolved, as in the Zeeman effect described in 
the following section.)!° The requirement for an overall change in parity 
and the selection rule for orbital angular momentum were discussed in 
Section 2.2. In a configuration n1l; ngl2 n3l3--+nzl, only one electron 
changes its value of | (and may also change n). The rule for AL al- 
lows transitions such as 3p4s ?P,—3p4p °P;. The selection rule AS = 0 
arises because the electric dipole operator does not act on spin, as noted 
in Chapter 3 on helium; as a consequence, singlets and triplets form 
two unconnected sets of energy levels, as shown in Fig. 3.5. Similarly, 
the singlet and triplet terms of magnesium shown in Fig. 5.9 could be 
rearranged. In the mercury atom, however, transitions with AS = 1 
occur, such as 6s? 'Sp—6s6p ?P,, that gives a so-called intercombination 
line with a wavelength of 254nm.!° This arises because this heavy atom 
is not accurately described by the LS-coupling scheme and the spin— 
orbit interaction mixes some 'P; wavefunction into the wavefunction 
for the term that has been labelled ?P,; (this being its major compo- 
nent). Although not completely forbidden, the rate of this transition is 
considerably less than it would be for a fully-allowed transition at the 
same wavelength; however, the intercombination line from a mercury 
lamp is strong because many of the atoms excited to triplet terms will 
decay back to the ground state via this transition (see Fig. 5.9).1" 


5.5 The Zeeman effect 


The Zeeman effect for atoms with a single valence electron was not 
presented in earlier chapters to avoid repetition and that case is covered 
by the general expression derived here for the L.S-coupling scheme.!® 
The atom’s magnetic moment has orbital and spin contributions (see 
Blundell 2001, Chapter 2): 


= —ppL — goupS. (5.9) 


The interaction of the atom with an external magnetic field is described 
by Hzm = —p-B. The expectation value of this Hamiltonian can be 
calculated in the basis |LS JM ), provided that Ezz « Es_o X Exe, 
i.e. the interaction can be treated as a perturbation to the fine-structure 


levels of the terms in the LS-coupling scheme. In the vector model we 
project the magnetic moment onto J (see Fig. 5.11) following the same 
rules as are used in treating fine structure in the LS-coupling scheme 
(and taking B = Be,). This gives 


(uw - J) 


(L- J) +95 (S- J) 
Gees Ca. 


oan oe 
ag re 1) 


Tens eee (5.10) 


In the vector model the quantities in angled brackets are time averages.!° 
In a quantum description treatment the quantities (---) are expectation 
values of the form (J Mj|---|J M ).?° In the vector model 


Ezr = guppBM,, (Sidi) 


where the Landé g-factor is gy = {(L- J) +9; (S-J)}/{J (J +1}. As- 
suming that g, ~ 2 (see Section 2.3.4) gives 


Semioseel) — Lb 1) 


oe) = os 1) a 


Singlet terms have S = 0 so J = L and gy = 1 (no projection is nec- 
essary). Thus singlets all have the same Zeeman splitting between My 
states and transitions between singlet terms exhibit the normal Zeeman 
effect (shown in Fig. 5.12). The AM, = +1 transitions have frequencies 
shifted by tupB/h with respect to the AM, = 0 transitions. 

In atoms with two valence electrons the transitions between triplet 
terms exhibit the anomalous Zeeman effect. The observed pattern de- 
pends on the values of g; and J for the upper and lower levels, as shown 
in Fig. 5.13. In both the normal and anomalous effects the z-transitions 
(AM, = 0) and o-transitions (AM, = +1) have the same polarizations 
as in the classical model in Section 1.8. Other examples in Exercises 5.10 
to 5.12 show how observation of the Zeeman pattern gives information 
about the angular momentum coupling in the atom. (The Zeeman ef- 
fect observed for the ?P/2~7S1/2 and ?P3/2—*S)/2 transitions that arise 
between the fine-structure components of the alkalis and hydrogen is 
treated in Exercise 5.13.) Exercise 5.14 goes through the Paschen—Back 
effect that occurs in a strong external magnetic field—see Fig. 5.14. 
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4B 


Fig. 5.11 The projection of the 
contributions to the total magnetic 
moment from the orbital motion and 
the spin are projected along J. 


19 Components perpendicular to J 
time-average to zero. 


20 This statement is justified by the pro- 
jection theorem (Section 5.1), derived 
from the more general Wigner—Eckart 
theorem. The theorem shows that the 
expectation value of the vector opera- 
tor L is proportional to that of J in the 
basis of eigenstates |J M7), i.e. 


(J My|L|J Mz) x (J Mz|S|J My), 


and similarly for the expectation value 
of S. The component along the mag- 
netic field is found by taking the dot 
product with B: 
(J M;|L-B|J M7) 
«(J M;|J-B|JM;) 
«(J My| Jz |J My) = My. 
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Fig. 5.12 The normal Zeeman effect 
on the 1ls2p 1P,—1s3d !Do line in he- 
lium. These levels split into three and 
five Mj states, respectively. Both lev- 
els have S = 0 and gy = 1 so that the 
allowed transitions between the states 
give the same pattern of three compo- 
nents as the classical model (in Sec- 
tion 1.8)—this is the historical reason 
why it is called the normal Zeeman ef- 
fect. Spectroscopists called any other 
pattern an anomalous Zeeman effect, 
although such patterns have a straight- 
forward explanation in quantum me- 
chanics and arise whenever S + 0, 
e.g. all atoms with one valence elec- 
tron have S = 1/2. The z- and o- 
components arise from AM, = 0 and 
AM, = +1 transitions, respectively. 
(In this example of the normal Zeeman 
effect each component corresponds to 
three allowed electric dipole transitions 
with the same frequency but they are 
drawn with a slight horizontal separa- 
tion for clarity.) 


Fig. 5.13 The anomalous Zeeman ef- 
fect for the 6s6p *P2-6s7s 3S, transi- 
tion in Hg. The lower and upper levels 
both have the same number of Zeeman 
sub-levels (or M7 states) as the levels in 
Fig. 5.12, but give rise to nine separate 
components because the levels have dif- 
ferent values of g7. (The 6s7s config- 
uration happens to have higher energy 
than 6s6p, as shown in Fig. 5.9, but the 
Zeeman pattern does not depend on the 
relative energy of the levels.) 
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5.6 Summary 


Figure 5.15 shows the different layers of structure in a case where the 
LS-coupling scheme is a good approximation, i.e. the residual electro- 
static interaction dominates the two magnetic interactions (spin-orbit 
and with an external magnetic field). The spin-orbit interaction splits 
terms into different J levels. The Zeeman effect of a weak magnetic 
field splits the levels into states of given M,, that are also referred to as 
Zeeman sub-levels. 
There are various ways in which this simple picture can break down. 


(a) Configuration mixing occurs if the residual electrostatic interaction 
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Fig. 5.14 The Paschen—Back effect oc- 
curs in a strong external magnetic field. 
The spin and orbital angular momen- 
tum precess independently about the 
magnetic field. The states have ener- 
gies given by Epp = wp B(M, +2Ms). 
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Fig. 5.15 The hierarchy of atomic structure for the 3s3p configuration of an alkaline earth metal atom. 
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is not small compared to the energy gap between the configurat- 
ions—this is common in atoms with complex electronic structure. 


The jj-coupling scheme is a better approximation than LS-coupling 
or Russell-Saunders coupling when the spin-orbit interaction is 
greater than the residual electrostatic interaction. 


The Paschen—Back effect arises when the interaction with an exter- 
nal magnetic field is stronger than the spin-orbit interaction (with 
the internal field). This condition is difficult to achieve except for 
atoms with a low atomic number and hence small fine structure. 
Similar physics arises in the study of the Zeeman effect on hyperfine 
structure where the transition between the low-field and high-field 
regimes occurs at values of the magnetic field that are easily acces- 
sible in experiments (see Section 6.3). 


(b) 


Further reading 


The mathematical methods that describe the way in which angular 
momenta couple together form the backbone of the theory of atomic 
structure. In this chapter the quantum mechanical operators have been 
treated by analogy with classical vectors (the vector model) and the 
Wigner—Eckart theorem was mentioned to justify the projection theo- 
rem. Graduate-level texts give a more comprehensive discussion of the 
quantum theory of angular momentum, e.g. Cowan (1981), Brink and 
Satchler (1993) and Sobelman (1996). 
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Exercises 


(5.1) Description of the LS-coupling scheme 
Explain what is meant by the central-field approx- 


of neutral calcium consists of three components 
at relative positions 0, 106 and 158 (in units of 


(5.2) 


imation and show how it leads to the concept of 
electron configurations. Explain how perturba- 
tions arising from (a) the residual electrostatic in- 
teractions, and (b) the magnetic spin-orbit inter- 
actions, modify the structure of an isolated multi- 
electron configuration in the LS-coupling limit. 


Fine structure in the LS-coupling scheme 
Show from eqn 5.4 that the J levels of the °P term 
in the 3s4p configuration have a separation given 
by eqn 5.8 with Brs = Bap/2 (where Gaps -1is the 
spin-orbit interaction of the 4p-electron). 


The LS-coupling scheme and the interval rule in 
calcrum 

Write down the ground configuration of calcium 
(Z = 20). The line at 610nm in the spectrum 


(5.4) 


cm"). Identify the terms and levels involved in 
these transitions. 

The spectrum also contains a multiplet of six lines 
with wavenumbers 5019, 5033, 5055, 5125, 5139 
and 5177 (in units of cm~'). Identify the terms 
and levels involved. Draw a diagram of the rel- 
evant energy levels and the transitions between 
them. What further experiment could be carried 
out to check the assignment of quantum numbers? 


The LS-coupling scheme in zinc 

The ground configuration of zinc is 4s’. The 
seven lowest energy levels of zinc are 0, 32311, 
32 501, 32 890, 46 745, 53672 and 55 789 (in units 
of cm~'). Sketch an energy-level diagram that 
shows these levels with appropriate quantum num- 
bers. What evidence do these levels provide that 


(5.5) 


(5.6) 


the L.S-coupling scheme describes this atom. Show 
the electric dipole transitions that are allowed be- 
tween the levels. 


The LS-coupling scheme 


3s3p, Mg 3s3p, Fe!** 
2.1850 23.386 
2.1870 23.966 
2.1911 25.378 
3.5051 35.193 


The table gives the energy levels, in units of 
10° m~’ measured from the ground state, of the 
3s3p configuration in neutral magnesium (Z = 
12) and the magnesium-like ion Fe’**. Suggest, 
with reasons, further quantum numbers to iden- 
tify these levels. Calculate the ratio of the spin— 
orbit interaction energies in the 3s3p configura- 
tion of Mg and Fe’*t, and explain your result. 
Discuss the occurrence in the solar spectrum of 
a strong line at 41.726nm that originates from 
Fe'*+, Would you expect a corresponding tran- 
sition in neutral Mg? 


LS-coupling for configurations with equivalent 
electrons 


(a) List the values of the magnetic quantum num- 
bers mi,, Ms,, Miz and ms, for the two elec- 
trons in an np” configuration to show that fif- 
teen degenerate states exist within the central- 
field approximation. Write down the values of 
Mr, =m, +m, and Ms = ms, +™Ms, associ- 
ated with each of these states to show that the 
only possible terms in the L.S-coupling scheme 
are 'S, 3P and ‘D. 

The 1s?2s”2p’ configuration of doubly-ionized 
oxygen has levels at relative positions 0, 113, 
307, 20271 and 43184 (in units of cm~*) 
above the ground state, and its spectrum con- 
tains weak emission lines at 19964cm~’ and 
20158cm~'. Identify the quantum numbers 
for each of the levels and discuss the extent to 
which the LS-coupling scheme describes this 
multiplet. 


For six d-electrons, in the same sub-shell, 
write a list of the values of the m; and m,; 
for the individual electrons corresponding to 
Ms = 2 and Mz = 2. Briefly discuss why this 
is the maximum value of Ms, and why Mz, < 2 
for this particular value of Ms. (Hence from 


(c) 


(5.7) 


(5.8) 


(5.9) 
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Hund’s rules the °D term has the lowest en- 
ergy.) Specify the lowest-energy term for each 
of the five configurations nd, nd’, nd?, nd* 
and nd°. 


Transition from LS- to jj-coupling 


3p4s, Si 3p7s, Si 
J Energy (10°m7') J Energy (10°m~*) 
0 3.968 6.154 
1 3.976 1 6.160 
2 3.996 2 6.182 
il 4,099 i 6.188 


The table gives J-values and energies (in units of 
10° m~* measured from the ground state) of the 
levels in the 3p4s and 3p7s configurations of sili- 
con. Suggest further quantum numbers to identify 
the levels. 

Why do the two configurations have nearly the 
same value of Ey=2 — Ey=o but quite different en- 
ergy separations between the two J = 1 states? 


Angular-momentum coupling schemes 


4p5s, germanium 5p6s, tin 
J Energy (10°m~') J Energy (10°m7!) 
0 3.19 0 3.47 
il Stl 1 3.49 
2 3.91 2 3.86 
i 4.00 il 3.93 


The table gives the J-values and energies (in units 
of 10° m~* measured from the ground state) of the 
levels in the configurations 4p5s in Ge and 5p6s in 
Sn. Data for the 3p4s configuration in Si are given 
in the previous exercise. How well does the LS- 
coupling scheme describe the energy levels of the 
np(n + 1)s configurations with n = 3, 4 and 5? 
Give a physical reason for the observed trends in 
the energy levels. 

One of the J = 1 levels in Ge has a Landé g-factor 
of gz = 1.06. Which level would you expect this 
to be and why? 


Selection rules in the LS-coupling scheme 

State the selection rules that determine the config- 
urations, terms and levels that can be connected 
by an electric dipole transition in the L.S-coupling 
approximation. Explain which rules are rigorous, 
and which depend on the validity of the coupling 
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(5.10) 


(5.11) 


(5.12) 


scheme. Give a physical justification for three of 
these rules. 

Which of the following are allowed for electric 
dipole transitions in the LS-coupling scheme: 


(a) 1s3d *D,-1s2s 3Si, 
(b) 1s3d 3D3-1s2p °P1, 
(e) 2p° =Pi-2s2p *Pi, 
(d) 3p” *P2-3p? *Pi, 
(e) 3p°3d 1Do-3p® 1So? 


The transition 4d°5s” ?Ds/2—4d)°5p P32 satis- 
fies the selection rules for L,S and J but it ap- 
pears to involve two electrons jumping at the same 
time. This arises from configuration mixing—the 
residual electrostatic interaction may mix configu- 
rations.*! The commutation relations in eqns 5.2 
and 5.3 imply that H,e only mixes terms of the 
same L,S and J. Suggest a suitable configuration 
that gives rise to a ?P3 /2 level that could mix with 
the 4d'°5p configuration to cause this transition. 


The anomalous Zeeman effect 

What selection rule governs AM, in electric dipole 
transitions? Verify that the °S,—*P2 transition 
leads to the pattern of nine equally-spaced lines 
shown in Fig. 5.13 when viewed perpendicular to 
a weak magnetic field. Find the spacing for a mag- 
netic flux density of 1T. 


The anomalous Zeeman effect 

Draw an energy-level diagram for the states of 3S; 
and °P; levels in a weak magnetic field. Indi- 
cate the allowed electric dipole transitions between 
the Zeeman states. Draw the pattern of lines ob- 
served perpendicular to the field on a frequency 
scale (marked in units of wa B/h). 


The anomalous Zeeman effect 


| 2.| 2a See? | 2aies lee | DerequEncy 


(5.13) 


(5.14) 


The above Zeeman pattern is observed for a spec- 
tral line that originates from one of the levels of a 
3P term in the spectrum of a two-electron system; 
the numbers indicate the relative separations of 
the lines, observed perpendicular to the direction 
of the applied magnetic field. Identify L, S and J 
for the two levels in the transition.” 


The anomalous Zeeman effect in alkalis 
Note that atoms with one valence electron are not 
discussed explicitly in the text. 


(a) Give the value of g, for the one-electron levels 
*S1/2) Piso and ?P3/2. 


Show that the Zeeman pattern for the 
3p ?P3/2-38 23, /2 transition in sodium has 
six equally-spaced lines when viewed perpen- 
dicular to a weak magnetic field. Find the 
spacing (in GHz) for a magnetic flux density 
of 1T. Sketch the Zeeman pattern observed 
along the magnetic field. 


(b) 


Sketch the Zeeman pattern observed per- 
pendicular to a weak magnetic field for the 
3p 7*P, /2-38 45) /2 transition in sodium. 

(d) The two fine-structure components of the 3p- 
3s transition in sodium in parts (b) and (c) 
have wavelengths of 589.6nm and 589.0 nm, 
respectively. What magnetic flux density pro- 
duces a Zeeman splitting comparable with the 
fine structure?" 


The Paschen—Back effect 

In a strong magnetic field L and S precess in- 
dependently about the field direction (as shown 
in Fig. 5.14), so that J and My are not good 
quantum numbers and appropriate eigenstates are 
|LM,SMs). This is called the Paschen—Back ef- 
fect. In this regime the LS-coupling selection rules 
are AM; = 0,+1 and AMs = 0 (because the elec- 
tric dipole operator does not act on the spin).”* 
Show that the Paschen—Back effect leads to a pat- 
tern of three lines with the same spacing as in the 
normal Zeeman effect (i.e. the same as if we com- 
pletely ignore spin).”° 


21In the discussion of the LS-coupling scheme we treated Hye as a perturbation on a configuration and assumed that Eye 
is small compared to the energy separation between the configurations in the central field. This is rarely true for high-lying 
configurations of complex atoms. 


22The relative intensities of the components have not been indicated. 


23This value is greater than 1T so the assumption of a weak field in part (b) is valid. 

?4The rules for J and M,; are not relevant in this regime. 

25The Paschen—Back effect occurs when the valence electrons interact more strongly with the external magnetic field than 
with the orbital field in Hs_.. The LS-coupling scheme still describes this system, i.e. L and S are good quantum numbers. 


Hyperfine structure and 
isotope shift 


6.1 Hyperfine structure 


Up to this point we have regarded the nucleus as an object of charge 
+Ze and mass My, but it has a magnetic moment yz; that is related to 
the nuclear spin I by 

Hy; = gr pNnI. (6.1) 


Comparing this to the electron’s magnetic moment —g, pp s we see that 
there is no minus sign.’ Nuclei have much smaller magnetic moments 
than electrons; the nuclear magneton pn is related to the Bohr magneton 
Lip by the electron-to-proton mass ratio: 

Me 


LB 
ES) = [Bis = ames 


~ ; 2 
M, 1836 2) 


The interaction of 4; with the magnetic flux density created by the 
atomic electrons B, gives the Hamiltonian 


Hyrs = —p,° Be. (6.3) 


This gives rise to hyperfine structure which, as its name suggests, is 
smaller than fine structure. Nevertheless, it is readily observable for 
isotopes that have a nuclear spin (I # 0). 

The magnetic field at the nucleus is largest for s-electrons and we 
shall calculate this case first. For completeness the hyperfine structure 
for electrons with J £ 0 is also briefly discussed, as well as other effects 
that can have similar magnitude. 


(ape 


We have previously considered the atomic electrons as having a charge 
distribution of density —e |z) (r)|*, e.g. in the interpretation of the direct 
integral in helium in eqn 3.15 (see also eqn 6.22). To calculate mag- 
netic interactions we need to consider an s-electron as a distribution of 
magnetisation given by 


Hyperfine structure for s-electrons 


M = ~gsups|v(r)|° . (6.4) 


This corresponds to the total magnetic moment of the electron —gspps 
spread out so that each volume element d°r has a fraction |q (r)|? dr 
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1 Nuclear magnetic moments can be 
parallel, or anti-parallel, to I, i.e. gz 
can have either sign depending on the 
way that the spin and orbital angular 
momenta of the protons and neutrons 
couple together inside the nucleus. The 
proton (that forms the nucleus of a hy- 
drogen atom) has gp > 0 because of 
its positive charge. More generally, nu- 
clear magnetic moments can be pre- 
dicted from the shell model of the nu- 
cleus. 
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Fig. 6.1 (a) An illustration of a nu- 
cleus with spin I surrounded by the 
spherically-symmetric probability dis- 
tribution of an s-electron. (b) That 
part of the s-electron distribution in the 
region r < rp corresponds to a sphere 
of magnetisation M anti-parallel to the 
spin s. 


2See Blundell (2001) or electromag- 
netism texts. 


3This spherical boundary at r = rp 
does not correspond to anything physi- 
cal in the atom but is chosen for math- 
ematical convenience. The radius rp 
should be greater than the radius of the 
nucleus rn and it easy to fulfil the con- 
ditions rn < rp < ao Since typical nu- 
clei have a size of a few fermi (10~1° m), 
which is five orders of magnitude less 
than atomic radii. 


4The magnetisation is a function of 
r only and therefore each shell has a 
uniform magnetisation M(r) between r 
and r+dr. The proof that these shells 
do not produce a magnetic flux den- 
sity at r = 0 does not require M(r) 
to be the same for all the shells, and 
clearly this is not the case. Alterna- 
tively, this result can be obtained by in- 
tegrating the contributions to the field 
at the origin from the magnetic mo- 
ments M(r) d°r over all angles (@ and 


?). 


(a) 


"  ® 


of the total. For s-electrons this distribution is spherically symmetric 
and surrounds the nucleus, as illustrated in Fig. 6.1. To calculate the 
field at r = 0 we shall use the result from classical electromagnetism? 
that inside a uniformly magnetised sphere the magnetic flux density is 


2 
Be = 3 ho M . 
However, we must be careful when applying this result since the distri- 
bution in eqn 6.4 is not uniform—it is a function of r. We consider the 
spherical distribution in two parts. 


(6.5) 


(a) A sphere of radius r = rp, where ry < ao so that the electronic 
wavefunction squared has a constant value of | (0)|? throughout 
this inner region, as indicated on Fig. 6.2.2 From eqn 6.5 the field 
inside this uniformly magnetised sphere is 

Be = —F Ho 9sLB |Vns (0)|? s. (6.6) 

(b) The part of the distribution outside the sphere r > ry produces no 
field at r = 0, as shown by the following argument. Equation 6.5 
for the field inside a sphere does not depend on the radius of that 
sphere—it gives the same field for a sphere of radius r and a sphere of 
radius r+dr. Therefore the contribution from each shell of thickness 
dr is zero. The region r > rp can be considered as being made up of 
many such shells that give no additional contribution to the field.* 


Putting this field and yz; from eqn 6.1 into eqn 6.3 gives 


2 
Hurs = gin 1+ 10 gett |Pns (0)? s= AI -s. 


; (6.7) 


Th {iP 


Fig. 6.2 (a) The probability density | (r)|? of an s-electron at small distances (r < 
ag) is almost constant. The distribution of nuclear matter py(r) gives an indication 
of the nuclear radius ry. To calculate the interaction of the nuclear magnetic moment 
with an s-electron the region is divided into two parts by a boundary surface of radius 
r = Tp > TN (as also shown in Fig. 6.1). The inner region corresponds to a sphere 
of uniform magnetisation that produces a flux density Be at r = 0. The nuclear 
magnetic moment interacts with this field. 


This is called the Fermi contact interaction since it depends on |1ns (0)|” 
being finite. It can also be expressed as 


Hype = AT J (6.8) 


because J = s for] = 0. It is useful to write down this more general 
form at an early stage since it turns out that an interaction proportional 
to I- J is also obtained when / # 0. 

We have already considered the effect of an interaction proportional 
to a dot product of two angular momenta when looking at the spin-orbit 
interaction GS -L (eqn 5.4). In the same way the hyperfine interaction 
in eqn 6.8 causes I and J to change direction but the total angular 
momentum of the atom F = I+ J remains constant. The quantities 
I-B and J -B are not constant in this precession of I and J around F. 
Therefore M; and M, are not good quantum numbers; we use Ff’ and 
Mr instead and the eigenstates of Hyrs are |I JF Mp).° The expectation 
value of eqn 6.8 gives 

Hues = A(-S) = 4 {F(R +1)- IE +1) - JJ +0}. (6.9) 
Example 6.1 Hyperfine structure of the 1s *Si/2 ground state of hy- 
drogen 
The lowest level of the hydrogen atom is 1s *Sj/2, ie. J = 5 and the 
proton has spin J = 1/2 so that F = 0 and 1 and these hyperfine levels 
have energies 


A/4 for = 1), 


Eyes = 4 {F(F +1) -1(I+1)- JJ +1} = om for F = 0. 


The splitting between the hyperfine levels is AEyrs = A (see Fig. 6.3). 
Substituting for {uns (0)|? from eqn 2.22 gives 


Ze 
3) pal G 
Tag nr 


2 
A = 3 Ho 9sB 91UN (6.10) 
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5 This should be compared with the LS- 
coupling scheme where combinations 
of the eigenstates |LM,SMzs) form 
eigenstates of the spin orbit interaction 
|LSJM,). The same warning issued for 
LS-coupling and the spin-orbit inter- 
action 8S-L also applies. It is impor- 
tant not to confuse J J-coupling and the 
interaction AI- J. In the IJ-coupling 
scheme J and J are good quantum num- 
bers and the states are |[JM;M,) or 
|[JF Mr). The latter are eigenstates 
of the interaction AI- J. 


Fig. 6.3 The splitting between the hy- 
perfine levels in the ground state of hy- 
drogen. 
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6 This hyperfine transition in hydrogen 
is the one detected in radio astronomy, 
where it is commonly referred to by its 
wavelength as the 21cm line. 


’This works on the same basic princi- 
ple as the Stern -Gerlach experiment in 
Section 6.4. 


Fig. 6.4 The hydrogen maser. ‘The 
principle of operation is described in 
the text. A magnetic shield excludes 
external fields and the solenoid cre- 
ates a small stable magnetic field. In 
this way the frequency shift produced 
by the Zeeman effect on the hyperfine 
structure is controlled (in the same way 
as in the atomic clock described in Sec- 
tion 6.4.2). 


The proton has a g-factor of gy = 5.6, so for n = 1= Z we have 


SEES ere. 


This is very close to the measured value of 1 420 405 751.7667 +0.0009 Hz. 
This frequency has been measured to such extremely high precision be- 
cause it forms the basis of the hydrogen maser, described below.® 


6.1.2 Hydrogen maser 


Maser stands for microwave amplification by stimulated emission of radi- 
ation and such devices were the precursors of lasers, although nowadays 
they are less widespread than the devices using light. Actually, lasers are 
usually operated as oscillators rather than amplifiers but the acronym 
with ‘o’ instead of ‘a’ is not so good. A schematic of a hydrogen maser 
is shown in Fig. 6.4; it operates in the following way. 


e Molecular hydrogen is dissociated in an electrical discharge. 


e Atoms effuse from the source to form a beam in an evacuated chamber. 


e The atoms pass through a region with a strong magnetic field gradient 
(from a hexapole magnet) that focuses atoms in the upper hyperfine 
level (F' = 1) into a glass bulb. The atomic beam contains atoms in 
both hyperfine levels but the state selection by the magnet” leads to a 
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population inversion in the bulb, i.e. the population in F = 1 exceeds 
that in # = 0; this gives more stimulated emission than absorption 
and hence gain, or amplification of radiation at the frequency of the 
transition. 


e The atoms bounce around inside the bulb—the walls have a ‘non-stick’ 
coating of teflon so that collisions do not change the hyperfine level. 


e The surrounding microwave cavity is tuned to the 1.42 GHz hyperfine 
frequency and maser action occurs when there are a sufficient number 
of atoms in the upper level. Power builds up in a microwave cavity, 
some of which is coupled out through a hole in the wall of the cavity. 


The maser frequency is very stable—much better than any quartz 
crystal used in watches. However, the output frequency is not precisely 
equal to the hyperfine frequency of the hydrogen atoms because of the 
collisions with the walls (see Section 6.4). 


6.1.3. Hyperfine structure for 1 4 0 
Electrons with | 4 0, orbiting around the nucleus, give a magnetic field 


o {-ev x (-r —3 -T)r 
B, = - {ai joe eres ea) a (6.11) 
where —r gives the position of the nucleus with respect to the orbiting 
electron. The first term arises from the orbital motion.® It contains 
the cross-product of —ev with —r, the position vector of the nucleus 
relative to the electron, and —er x v = —2ypl. The second term is just 
the magnetic field produced by the spin dipole moment of the electron 
[t. = —2pgs (taking g, = 2) at a position —r with respect to the dipole.® 
Thus we can write 


(6.12) 


This combination of orbital and spin—dipolar fields has a complicated 
vector form.!? However, we can again use the argument (in Section 5.1) 
that in the vector model there is rapid precession around J, and any 
components perpendicular to this quantisation axis average to zero, so 
that only components along J have a non-zero time-averaged value." 
The projection factor can be evaluated exactly (Woodgate 1980) but we 
shall assume that it is approximately unity giving 
Ho / HB 
Be ae ( re ) : 

Thus from eqns 6.1 and 6.3 we find that the hyperfine interaction for 
electrons with 1 4 0 has the same form AI-J as eqn 6.8. This form of 
interaction leads to the following interval rule for hyperfine structure: 


(6.14) 


(6.13) 


Ep —-Ep1=AF. 


This interval rule is derived in the same way as eqn 5.8 for fine structure 
but with J, J and F instead of L, S and J, as shown in Exercise 6.5. 
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8 This resembles the Biot—Savart law of 
electromagnetism: 


_ po Idsxr 

- Ar v3 
Roughly speaking, the displacement 
along the direction of the current is re- 
lated to the electron’s velocity by ds = 
v dt, where dt is a small increment of 
time, and the current is related to the 
charge by [dt = Q. 
The spin-orbit interaction can be very 
crudely ‘justified’ in a similar way, by 
saying that the electron ‘sees’ the nu- 
cleus of charge +Ze moving round it; 
for a hydrogenic system this simplistic 
argument gives 


Ho HB 

———], 

4n 73 

The Thomas precession factor does not 
occur in hyperfine structure because 
the frame of reference is not rotating. 


°See Blundell (2001). 


10The same two contributions to the 
field also occur in the fine structure of 
helium; the field produced at the po- 
sition of one electron by the orbital 
motion of the other electron is called 
the spin-other-orbit interaction, and a 
spin-spin interaction arises from the 
field produced by the magnetic dipole 
of one electron at the other electron. 


Borbital = —2Z 


Tn, quantum mechanics this corre- 
sponds to saying that the matrix 
elements of any vector operator in 
the eigenbasis |J My) are propor- 
tional to J, i.e. (J Mj|Be|J Mz) x 
(J Mj;|JI|J M7). This is a consequence 
of the Wigner—Eckart theorem that was 
mentioned in Section 5.1. 
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Table 6.1 Comparison of fine and hyperfine structures. 


Hyperfine structure in the 
I J-coupling scheme 


Fine structure in the 
LS-coupling scheme 


Interaction 
Total angular momentum 
Eigenstates 


Energy, £ 


Interval rule 


8 ¢7(J+1)-L(L+1)-S(S4+1)} 


BL-S Al-J 
J=L+Ss F=1+J 
|LSJM,) [JF Mr) 


A{F(F+1)-I(1+1)-J(J +1} 


Er — Erp_-1 = AF 
(if A> AE Quadrupole) 


Ey — Ej-1 = BJ 
(if Bem5) Ss Exe) 


12-This interval rule for magnetic dipole 
hyperfine structure can be disrupted by 
the quadrupole interaction. Some nu- 
clei are not spherical and their charge 
distribution has a quadrupole moment 
that interacts with the gradient of the 
electric field at the nucleus. This elec- 
tric quadrupole interaction turns out to 
have an energy comparable to the inter- 
action of the magnetic dipole moment 
fr with Be. Nuclei, and atoms, do not 
have static electric dipole moments (for 
states of definite parity). 


That exercise also shows how this rule can be used to deduce F' and 
hence the nuclear spin J from a given hyperfine structure.” 

The hyperfine-structure constant A(n,l,j) is smaller for ] > 0 than 
for 1 = 0 and the same n. Exact calculation shows that the hyperfine- 
structure constants of the hydrogenic levels np *P;/2 and ns *Sj/2 are 
in the ratio 

A(n es /2) x 1 


A(n?Si;2) 3 — 


This ratio is smaller in the alkalis, e.g. ~ 1/10 in the examples below, 
because the closed shells of electrons screen the nuclear charge more 
effectively for p-electrons than for s-electrons. 


6.1.4 Comparison of hyperfine and fine structures 


The analogy between hyperfine and fine structures is summarised in 
Table 6.1. 
For fine structure in the alkalis we found the Landé formula (eqn 4.13) 
272 
Zi Zo a*hcRa - 
(n*) 
The Z* scaling for a hydrogenic system is reduced to Eps « Z? for neu- 
tral atoms since the effective outer atomic number is Zp = 1, and Z; ~ Z 
gives a reasonable approximation in the inner region. Applying similar 
considerations to the hyperfine structure shows that the dependence on 
Z® in eqn 6.10 reduces to 


Eps ~ (6.16) 


(6.17) 


The mass ratio arises from un/wB = Me/M,. Hyperfine structure scales 
as Z, whereas fine structure scales as Z?; thus Eqyps varies much less 
than Eps, as the following comparison of the splittings for Na and Cs 
shows. 


Nel f= Il 


E(3p *P3/2) — E(3p PP Iya); 
Afrs = 510 GHz 


For the ground state 3s 7S1/2, 


A furs = 1.8 GHz 


For 3p eae 
A furs = 0.18 GHz 


CsZe— 250 


E(6p *P3/2) =F E(6p *Pi/2), 
Afrs = 16600 GHz 


For the ground state 6s 7S; 2, 


A furs = 9.2 GHz 


For 6p Glaay ey 
A furs = 1.2 GHz 


The hyperfine splitting of the ground states and the fine-structure 
splitting of the first excited states are indicated on the plot of energies 
against Z in Fig. 6.5. The values shown are only a guideline; e.g. different 


e Energy (eV) @ Gross structure 
Bi Residual electrostatic energy 
a Fine structure (first excited state) 
ee ® A Hyperfine structure (ground state) 
10 4@ 


LO 


10-2 


10-3 


10> 


10-6 
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Fig. 6.5 A logarithmic plot of the 
energy of various structures against 
atomic number Z; the hyperfine split- 
ting of the ground state is plotted with 
data from Fig. 5.7. All the points are 
close to the maximum values of that 
quantity for low-lying configurations, 
terms, levels and hyperfine levels (as 
appropriate) of neutral atoms with one 
or two valence electrons, and these il- 
lustrate how these quantities vary with 
Z. This is only a rough guideline in 
particular cases; higher-lying configura- 
tions in neutral atoms have smaller val- 
ues and in highly-ionized systems the 
structures have higher energies. 
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isotopes of the same element have different hyperfine splittings because 
the magnetic moment py; depends on the nuclear structure. The ground 
state of hydrogen has an especially large hyperfine structure that is 
greater than that of lithium (Z = 3), see Exercise 6.3. 


Example 6.2 Hyperfine structure of europium 
Figure 6.6 shows an experimental trace of a 4f"6s? °S7/2—4f"6s6d °D44/2 
transition in europium obtained by Doppler-free laser spectroscopy (Kro- 
13Two-photon spectroscopy is ex- nfeldt and Weber 1991).'% The ground level (4f"6s? °S7/2) has a small 
plained in Section 8.4 hyperfine structure, from the unpaired f-electrons, that causes the small 
splitting of the peaks labelled 3, 4, 5, 6 and 7 (barely resolved for peak 
3); however, this detail will not be considered further in the following 
141¢ js straightforward to apply the in- analysis,!4 which concentrates on the much larger hyperfine structure of 
terval rule when both the lower and the 4f’6s6d °Dj, /2 level that arises mainly from the unpaired s-electron. 
upper levels have hyperfine structure, The spectrum has a dozen main peaks. Since J = 11/2 a naive anal- 
which is generally the case except in j d ; 
carefully chosen examples such as this. YsiS might suppose that J > J and so the observed peaks arise from 
transitions to the 2 + 1 = 12 hyperfine levels expected in this case, i.e. 
F=I14+J,I+J-1,...,[—J+1andI-—J. This is obviously wrong for 
various reasons: it is clear that the pattern of all twelve peaks does not 
fit any simple rule and also this element has two stable isotopes °!Eu 
and !°3Eu. As indicated by their similar shape, the peaks labelled 3, 4, 
5, 6 and 7 all belong to same isotope (1°! Eu) and this can be verified by 
the interval rule as shown in the following table. 


I I Il IV 
Peak Position (GHz) =*=!—“* (GHz) Ratio of differences, => 
3 21.96 = - - 
4 19.14 2.82 - - 
5 15.61 3.53 1.252 5.0 
6 eS 4.24 1.201 6.0 
7 6.42 4.95 1.167 7.0 
g 0.77 5.65 1.141 8.1 
g f e 7d c b 6a 2 4 3 


| 


0 =) 10 15 20 f (GHz) 


= 


Fig. 6.6 An experimental trace of a 4f"6s? °S7/.-4f6s6d &Dj,;/) transition in europium obtained by Doppler-free laser 
spectroscopy (Kronfeldt and Weber 1991). Copyright 1991 by the American Physical Society. 


Column IT in this table gives the positions of the peaks!® measured 
from Fig. 6.6. Column III gives the difference between the frequencies in 
column IT (the intervals between the peaks), e.g. 21.96 — 19.14 = 2.82. 
Column IV gives the ratio of the intervals in column III, e.g. 3.53/2.82 = 
1.252. The interval rule for hyperfine structure in eqn 6.14 predicts that 


Ep — Ep_, AF F 


i ee AG) = Fai 


(6.18) 


Rearrangement gives the total angular momentum F in terms of z as 


Paes 


eae (6.19) 
The numerical values of this quantity in column V (that have been cal- 
culated from the data by the above procedure) confirm that F’ has the 
value used to label the peaks. Moreover, we find that peak g fits the in- 
terval rule with F' = 8. Thus, since this level has J = 11/2, this isotope 
(15!Bu) must have a nuclear spin of I = 5/2—this follows from the rules 
for the addition of angular momentum which allow values of F' between 
Finnax = 1+ J =8 and Fain = |I — J| = 3.1¢ Exercise 6.5 shows that the 
other six peaks a to f also obey an interval rule and they all belong to 
another isotope (1°°Eu). 


6.2 Isotope shift 


In addition to the (magnetic dipole) hyperfine interaction in eqn 6.8 
there are several other effects that may have a comparable magnitude 
(or might even be larger).!” This section describes two effects that lead 
to a difference in the frequency of the spectral lines emitted by different 
isotopes of an element. 


6.2.1 Mass effects 


In Chapter 1 we saw that, in the Bohr model, energies are proportional 
to the reduced mass of the electron, given in eqn 1.13, and this scal- 
ing also applies to the solutions of the Schrodinger equation. Thus a 
transition between two levels of energies E, and Ez has a wavenumber 
V = (Ey — E,) /he that is related to 1%, the value for a ‘theoretical’ 
atom with a nucleus of infinite mass, by 


Mn 
Me + Mn i 


en (6.20) 
where Mn is the mass of the nucleus. However, ¥.. cannot be measured. 
What we can observe is the difference in wavenumbers between two 
isotopes of an element, e.g. hydrogen and deuterium for Z = 1. In 
general, for two isotopes with atomic masses A’ and A”, we can make 
the approximation My = A'M, or A" Mp, so that}® 
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15The highest peak in the case of the 
closely-spaced pairs. 


16-The proof of this result using opera- 
tors can be found in quantum mechan- 
ics texts. It can be justified by anal- 
ogy with vector addition: the maxi- 
mum value occurs when the two an- 
gular momentum vectors point in the 
same direction and the minimum value 
when they are anti-parallel. 


17 The quadrupole interaction was 
noted in Table 6.1, but will not be dis- 
cussed further. 


18 strictly, atomic mass units should be 
used rather than Mp. The difference 
between the mass of an atom and its 
nucleus equals the mass of the electrons 
including the contribution from their 
binding energy. However, for this esti- 
mate we do not need to know My pre- 
cisely. 
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19 See Exercise 6.12 and also Woodgate 
(1980). 


20 For hyperfine structure we were con- 
cerned with the nuclear magnetic mo- 
ment arising from the constituent pro- 
tons and neutrons. ‘To calculate the 
electrostatic effect of a finite nuclear 
size, however, we need to consider the 
charge distribution of the nucleus, i.e. 
the distribution of protons. This has 
a shape that is similar to, but not 
the same as, the distribution of nuclear 
matter (see nuclear physics texts). The 
essential point for atomic structure is 
that all the nuclear distributions extend 
over a distance small compared to the 
electronic wavefunctions, as illustrated 
Wl JONES (OM 


21 1¢ this worries you, then put an arbi- 
trary constant ¢o in the equation, aris- 
ing from the integration of the electric 
field to give the electrostatic potential, 
and show that the answer does not de- 
pend on ¢o. 


pes Vise 


1+m,/A"M, ~ 14m./A'M, 


~D | ees | else 
ae A"M, | A'M, 


Avass = Van oc Var = 


(6.21) 


This is called the normal mass shift and the energy difference hcAV ass 
is plotted in Fig. 6.7, assuming that 6A = 1, A ~ A’ ~ 2Z, and that 
E> — E, ~ 2eV for a visible transition. The mass shift is largest for 
hydrogen and deuterium where A” = 2A’ ~ 2M, (Exercise 1.1); it is 
larger than the fine structure in this case. For atoms with more than 
one electron there is also a specific mass shift that has the same order of 
magnitude as the normal mass effect, but is much harder to calculate.’ 
Equation 6.20 shows that the mass shift always leads to the heavier 
isotope having a higher wavenumber—by definition the reduced mass of 
the electron is less than me, and as the atomic mass increases the energy 
levels become closer to those of the theoretical atom with a nucleus of 
infinite mass. 


6.2.2 Volume shift 


Although nuclei have radii which are small compared to the scale of 
electronic wavefunctions, ry < ao, the nuclear size has a measurable 
effect on spectral lines. This finite nuclear size effect can be calculated 
as a perturbation in two complementary ways. A simple method uses 
Gauss’ theorem to determine how the electric field of the nuclear charge 
distribution differs from —Ze/4megr? for r < ry (see Woodgate 1980). 

Alternatively, to calculate the electrostatic interaction of two overlap- 
ping charge distributions (as in eqn 3.15, for example) we can equally 
well find the energy of the nucleus in the potential created by the elec- 
tronic charge distribution (in an analogous way to the calculation of the 
magnetic field at the nucleus created by s-electrons in Section 6.1.1). 
The charge distributions for an s-electron and a typical nucleus closely 
resemble those shown in Fig. 6.2.78 In the region close to the nucleus 
there is a uniform electronic charge density 


Pe = —e | (0)|? . 


Using Gauss’ theorem to find the electric field at the surface of a sphere 
of radius r in a region of uniform charge density shows that the electric 
field is proportional to r. Integration gives the electrostatic potential: 


(6.22) 


EE 
6€o 


Oe (r) = (6.23) 
The zero of the potential has been chosen to be ¢- (0) = 0. Although this 
is not the usual convention the difference in energy that we calculate does 
not depend on this choice.” With this convention a point-like: nucleus 
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has zero potential energy, and for a distribution of nuclear charge pn (r) 
the potential energy is 


Fva= [ff odode =< wor ff f pwr? ars 


Ze? 

= =— WWF (rk). a) 
€0 

The integral gives the mean-square charge radius of the nucleus (r2) 

times the charge Ze. This volume effect only applies to configurations 

with s-electrons. The liquid drop model gives a formula for the radius 
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Fig. 6.7 A plot of the energy of var- 
ious structures against atomic number 
Z. This includes the data shown on 
Fig. 6.5, but the scale has been changed 
to show the effects of lower energy. 
The contributions to the isotope shift 
from the mass and volume effects are 
plotted using eqns 6.21 and 6.26, re- 
spectively. Although these effects arise 
from completely independent physical 
effects, they have a similar magnitude 
for medium-heavy elements (and can 
be comparable to the magnetic dipole 
hyperfine structure of an excited state, 
which is considerably smaller than the 
ground-state splitting). The Doppler 
width for a visible transition with en- 
ergy 2eV is plotted to indicate the 
typical experimental limit in spectro- 
scopic measurements using light; the 
line has a slope of —1/2 because Afp « 
1/VM and the atomic mass A is ap- 
proximately A ~ 2Z, so that Afp « 
Z~1/2 (see eqn 8.7). For hydrogen 
the Doppler width is almost equal to 
the fine-structure splitting (and much 
larger than hyperfine structure). The 
Doppler broadening of visible lines in 
sodium is comparable with its hyper- 
fine structure; for heavier elements the 
ground-state hyperfine structure can be 
resolved in a vapour at room tempera- 
ture. This gives an indication of the 
importance of Doppler-free techniques 
described in Chapter 8. The Zeeman 
effect of a magnetic flux density of B = 
1T is equivalent to a frequency shift of 
14 GHz. 
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22More accurate calculations can be 
made directly from eqn 6.24 in individ- 
ual cases, e.g. the 1s configuration in 
hydrogen has 


4 Pe 
By = (rx) heRoo =~ 5 X 107% eV. 
3 ag 


The proton has a root-mean-square 
charge radius of (2)? = 0.875fm 
(CODATA value). 


23The size of the nucleus (r2,) gen- 
erally increases with A following the 
trend in eqn 6.25 but there are excep- 
tions, e.g. a nucleus that is particularly 
stable because it has closed shells of 
nucleons can be smaller than a lighter 
nucleus. Experimental measurements 
of isotope shifts, and the deduced val- 
ues of the volume effect, are used to 
study such behaviour. (Similarly, for 
atoms, the shell structure makes inert 
gas atoms exceptionally small. More 
generally, the variation of atomic size 
with atomic mass is opposite to that 
of ionization energy—alkali atoms are 
larger than nearby atoms in the peri- 
odic table.) 


24 S0@ Woodgate (1980) for further de- 
tails. 


29 High-energy electron scattering ex- 
periments also probe the nuclear charge 
distribution. 


of a nucleus as 


ry ~ 1.2 x A/? fm. (6.25) 


Using this equation and making the same approximations for the wave- 
function squared as in hyperfine structure (eqn 6.17), we can write the 
isotope shift caused by the volume effect as 


_ AEvo _ (tN) dA 2? 


Arvo hp a az A (n*)° 


ae (6.26) 


This has been used to plot Ayo) as a function of Z in Fig. 6.7, assuming 
that 6A = 1, A~ 2Z and n* ~ 2.” 

This volume effect decreases the binding energy of a given atomic level 
with respect to that of a ‘theoretical’ atom with a point charge. The 
resulting change in the transition depends on whether the effect occurs 
in the upper or lower level (see Exercise 6.9).7% 


6.2.3. Nuclear information from atoms 


We have shown that the nucleus has an observable effect on atomic spec- 
tra. If hyperfine structure is observed then one immediately knows that 
the nucleus has spin and the number of hyperfine components sets a 
lower limit on J (Example 6.1). The values of F’, and hence J, can be 
deduced by checking the interval rule, and the sum rule for relative inten- 
sities (similar to that for fine structure in Section 4.6.1). In principle, the 
magnetic moment of the nucleus jz; can be deduced from the hyperfine- 
structure constant A, e.g. calculations such as that in Section 6.1.1 are 
accurate for light atoms. For atoms with a higher Z, the relativistic ef- 
fects are important for the electronic wavefunction near the nucleus and 
it is more difficult to calculate |w (0)|?. However, the electronic factors 
cancel in ratios of the hyperfine-structure constants of isotopes of the 
same element to give accurate ratios of their magnetic moments, i.e. if 
the yz; is known for one isotope then it can be deduced for the other 
isotopes (see Exercise 6.4). 

Similarly, isotope shifts give the difference in the nuclear sizes between 
isotopes, A (rx), assuming that the mass effects are calculable.24 To 
interpret this information, it is necessary to know the absolute value of 
the charge radius for one of the isotopes by another means, e.g. muonic 
X-rays. These transitions between the energy levels of a muon bound to 
an atomic nucleus have a very large volume effect, from which Cr) can 
be deduced (see Exercise 6.13).?° 


6.3. Zeeman effect and hyperfine structure 


The treatment of the Zeeman effect on hyperfine structure (in the I J- 
coupling scheme) closely resembles that described in Section 5.5 for the 
LS-coupling scheme, and the detailed explanation of each step is not 
repeated here. The total atomic magnetic moment of the atom is the 
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sum of the electronic and nuclear moments (cf. eqn 5.9): 


HMatom = —gs BI + 9I LN ieee —gj PBI - (6.27) 


Since un < fp we can neglect the nuclear contribution (for all but the 
most precise measurements), so that the Hamiltonian for the interaction 
with an external field B is just that for the electronic magnetic moment: 


H = 97/13) -B. (6.28) 


This interaction does not depend on the nuclear spin. However, its 
expectation value does depend on the hyperfine structure. We consider 
first the weak-field regime where the interaction with the external field 
is weaker than AI-J so that it can be treated as a perturbation to the 
hyperfine structure. We then treat the strong-field regime, and also the 
intermediate situation. 


6.3.1 Zeeman effect of a weak field, upB< A 


If the interaction with the external field in eqn 6.28 is weaker than the J 
hyperfine interaction AI-J, then in the vector model J and I move 
rapidly about their resultant F, as illustrated in Fig. 6.8, whilst F itself 
precesses more slowly about the magnetic field (z-axis). In this regime 
F and Mr are good quantum numbers, but M; and M, are not. Tak- 
ing the projection of the magnetic moments along F gives the effective 
Hamiltonian 


— 


a gruBk - B= grppB F. , (6.29) 


igs = 9EB ROR +1) 


Fig. 6.8 The /J-coupling scheme. 


where 
Ci eo +h) (1 al) 


2F (F +1) 
Here the factor gr arises from the projection of J onto F, as illustrated 
in Fig. 6.9, in the just same way as gy is given by the projection of L 
and S onto J in Section 5.5. The Zeeman energy is 


9F = GJ - (6.30) 


p= grupBMr . (6.31) 


As an example, consider the ground-state hyperfine levels in hydrogen 
(= J=1/2 and g3 = 9, ~ 2). For F = 1 we find gr = 1 so the three 
states Mp = —1, 0 and 1 are spaced by wp B. The F = 0, Mr = 0 state 
has no first-order Zeeman shift (see Fig. 6.10). 

In summary, the calculation of the Zeeman effect of a weak magnetic 
field on the hyperfine structure is simple because only the magnetic mo- 
ment of the electron(s) along J contributes, whereas in the L.S-coupling 
scheme there are components along both L and 8S. However, I affects gr 
because the nuclear angular momentum I is not small, and has a major 
effect on the J.J F-triangle (thinking in terms of vectors as in Fig. 6.9), 
even though the nuclear magnetic moment is negligible. 
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Fig. 6.9 The projection of the contri- 
butions to the magnetic moment from 
the atomic electrons along F. The mag- 
netic moment of the nucleus is negligi- 
ble in comparison. 


Fig. 6.10 The Zeeman effect on the 
hyperfine structure of the ground level 
of hydrogen 1s 7S;/2. The interval be- 
tween the F = 0 and F = 1 levels 
is A, as shown in Fig. 6.3. The zero 
of the energy scale has been chosen to 
be midway between the levels at zero 
field, which is convenient for the cal- 
culations in the text. (The two states 
with Mp = 0 in the low-field regime 
are mixed by the perturbation and then 
move apart as the magnetic flux density 
increases.) The quantity x = wp B/A is 
plotted on the horizontal axis and the 
low- and high-field regimes correspond 
tolni<< anda > 1. 


26But smaller than the fine structure 
of the first excited state, except for 
the lightest elements. High-lying levels, 
however, have smaller fine structure. 


wily tele 


a 
Low field B High field 


6.3.2 Zeeman effect of a strong field, uzpB > A 


A strong field is one where the interaction with the external field, in 
eqn 6.28, is greater than AI-J. This can readily be achieved for hy- 
perfine structures since the Zeeman energy ppB in a field of 1T is 
about 6 x 10~° eV (corresponding to a frequency of 14GHz), which is 
greater than all but the largest hyperfine structures in the ground con- 
figurations of heavy elements (see Fig. 6.7).2° In this regime F is not a 
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good quantum number and J precesses about B.?” The effect of the hy- 
perfine interaction can be calculated as a perturbation on the |[M;JM,) 
eigenstates, i.e. 
Eze = gsppBM, + (IM, JMj| Al-J |[M; JM) 
=gs~pBM;+AM,M,. 


(6.32) 
(6.33) 


The first term is the same as eqn 5.11. In the second term, I- J = 
I,J, + IyJy +I,Jz and the z- and y-components average to zero in the 
precession about the field along the z-direction.® 

An example of the energy levels in a strong field is shown in Fig. 6.10 
for the hydrogen ground state. The two energy levels with My; = +1/2 
are both split into sub-levels with M; = +1/2 by the hyperfine inter- 
action; eqn 6.33 shows that these sub-levels have a separation of A/2 
(independent of the field strength). 


6.3.3 Intermediate field strength 


In Fig. 6.10 the low- and high-field energy levels follow the rule that two 
states never cross if they have the same value of M, where at low fields 
M = Mf and at high fields M = M; + Mj. This implies that 


M; M; 
foi Mp=0 3) 41/2) -=1/2: 
0 Mp=O. == 1/2, 21/2) 


This rule can be justified by showing that the operator J, + J, com- 
mutes with all the interactions and it allows unambiguous connection 
of states even in more complex cases.”® For the simple case of hydrogen 
the energy levels can be calculated at all fields by simple perturbation 
theory, as shown below. 


Example 6.3 The Zeeman effect on the hyperfine structure of hydrogen 
for all field strengths 

Figure 6.10 shows the energy levels for all field strengths. The Zeeman 
energies of the M = +1 states are typB for all fields because their 
wavefunctions are not mixed (gr = 1 from eqn 6.30). The Mr = 0 
states have no first-order shift but the magnetic field mixes these two 
states in the F = 0 and 1 hyperfine levels; the matrix element between 
them is 


—~(F =1,Mp =0|u-B|F = 0, Mp = 0) = CupB. 


Such (off-diagonal) matrix elements can be evaluated by angular momen- 
tum theory, but in this simple case we can get by without using Clebsch— 
Gordan coefficients (leaving ¢ as an undetermined constant for the time 
being). The Hamiltonian for the two-level system is 


n= Bie fie iF (6.34) 


CupB —A/2 


iLL 


27The nuclear angular momentum I 
does not precess around B because 
—p,-B is negligible. In this regime the 
interaction AI-J makes I precess about 
the mean direction of J, which is par- 
allel to B. Thus effectively I precesses 
about the axis defined by B (but not 
because of —jz; _B). The vector model 
picture requires careful thought be- 
cause of the subtle differences from the 
Paschen—Back effect (Fig. 5.14). In the 
quantum mechanical description this 
is taken into account by considering 
the relative magnitudes of the pertur- 
bations: |y.-B| > |AI-J| > |u, - Bi, 
where ps, is the magnetic moment of 
the atomic electrons in eqn 5.9. 


28This can be shown rigorously using 
the ladder operators 


ly =I, +il,, 
I_ =I, —-il, , 
and similarly for Ji and J_. These 


ladder operators change the magnetic 
quantum numbers, e.g. 


I. |I Mz) « |\f M; +1). 
Since 
il 
IgJq + Iydy = 5 Chdi, AS Ida). 


the expectation value of this part of I-J 
is zero (for states of given My and M; 
as in eqn 6.32). 


29 At low fields, I, + Jz = F., which 
clearly commutes with the interaction 
in eqn 6.29. At high fields the relevant 
interactions are proportional to Jz and 
Ty Jy + Ty Jy +12Jz, both of which com- 
mute with I, + Jz. 
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30 Any choice for the point at which 
E = 0 leads to the same result, e.g. 
taking the unperturbed energies as A/4 
and —3A/4 as in Fig. 6.3. 


31-The avoided crossing of states that 
mix is a general feature of perturbation 
theory. 


32-The two spins are aligned in a triplet 
state so the 5s- and 6s-electrons pro- 
duce fields in the same direction. 


The energies are measured from the point midway between the hyperfine 
levels to streamline the algebra.2° The energy eigenvalues are 


E= +y/(4/2)? + (CupB)*. (6.35) 


This exact solution for all fields is plotted in Fig. 6.10. The approximate 
solution for weak fields is 


i JA, GupB)’ 
1D G2 Se {3 ae oan : (6.36) 


When B = 0 the two unperturbed levels have energies +A/2. The 
term proportional to B? is the usual second-order perturbation theory 
expression that causes the levels to avoid one another (hence the rule 
that states of the same M do not cross).?! 

For strong fields, where up B > A, eqn 6.35 gives the energy of the 
M = 0 states as 


E(F = +1,Mp =0) ~ +CupB. (6.37) 


In a strong field the energy levels of the system are given by gjupB My, 
and the two M; = +1/2 states have Zeeman energies of gugpBMy = 
+ppB. Comparison with eqn 6.37 shows that ¢ = 1, and so we have 
found the energies for all field strengths. The other two states have 
energies E(M = +1) = $A + ppB for all values of B. 

A similar approach can be used when J = 1/2 for arbitrary values 
of I, which applies to the ground states of the alkalis. For J > 1/2 
there are more states to consider than in hydrogen (where I = 1/2), so 
the Hamiltonian will have larger dimensions than in eqn 6.34. Actually 
there were four basis states to consider in hydrogen, but because the 
perturbation mixes only two of them it was only necessary to diagonalise 
a 2x 2 matrix. 


6.4 Measurement of hyperfine structure 


An apparatus similar to that for measuring the Zeeman effect, shown 
in Fig. 1.7(a), can be used to observe hyperfine structure—a magnet is 
not required because hyperfine structure arises from an internal mag- 
netic field of the atom. Figure 6.11 shows a typical experimental trace 
obtained from such an experiment with a pressure-scanned Fabry—Perot 
étalon for the 5s5p ?Po—5s6s °S, line in cadmium. The ?Pp level has no 
hyperfine structure (because J = 0) and the observed splitting comes en- 
tirely from the 5s6s °S; level. Both s-electrons contribute to the field at 
the nucleus*” so this level has an exceptionally large hyperfine splitting 
that is greater than the Doppler broadening. 

The fact that the J = 1 level only gives two hyperfine levels implies 
that J = 1/2. If J > 1 then there would be three levels (by the rules for 
addition of angular momenta). One of these peaks might, however, be 
hidden underneath the large central peak? That this is not the case here 
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can be checked by verifying that the observed peaks have the expected 
displacements from their centre of gravity (which is approximately at 
the position of the even isotopes).*° 

It can be seen from Fig. 6.11 that optical spectroscopy is not generally 
suitable for measuring the Zeeman effect of hyperfine structure because 
spectral lines have a Doppler broadening that can be comparable with 
the hyperfine splitting. Thus in the low-field regime (upB < A) the 
Zeeman splitting is too small to be resolved. To show this quantitatively 


Fig. 6.11 (a) The 5s6s *S, level in the 
odd isotopes of cadmium (144Cd and 
113Cq) has a large hyperfine structure; 
the F = 3/2 hyperfine level lies be- 
low F = 1/2 because the nuclear g- 
factor is negative and similar in size 
for both odd isotopes. (The total mag- 
netic field created by the unpaired s- 
electrons at the nucleus is anti-parallel 
to J, as in the ground state of hydro- 
gen and the alkalis.) (b) An exper- 
imental trace of the hyperfine struc- 
ture for the 5s5p 3P9—5s6s 3S, line 
at a wavelength of 468nm obtained 
with a pressure-scanned Fabry Perot 
étalon (as in Fig. 1.7). There are three 
peaks in each order of the étalon: a 
large peak from the isotopes with no 
nuclear spin J = 0 and hence no hyper- 
fine structure (generally isotopes with 
an even number of nucleons so there are 
no unpaired spins within the nucleus); 
and two smaller peaks whose separa- 
tion equals the hyperfine splitting of 
the 5s6s 3S, level. (Data from the Ox- 
ford Physics Teaching Laboratory prac- 
tical course; further details are in Lewis 
(1977).) 


33 This is not straightforward in this ex- 
ample because of the overlap of lines, 
but this could be done using a curve- 
fitting program on a computer. Addi- 
tionally, there is a sum rule for the in- 
tensities as in fine structure. 
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34-This gives a value of Afp that differs 
by only Vln2 = 0.8 from the exact re- 


sult for the full width at half maximum 
(FWHM) derived in Chapter 8. 


35The Zeeman splitting of a spectral 
line can only be resolved when the field 
is strong enough to give the Paschen— 
Back effect. 


Fig. 6.12 Doppler-free laser spectro- 
scopy of the 5p? ?Po-5p6s 2P, line of 
tin; the reduction in Doppler broad- 
ening (cf. Fig. 6.11) reveals the iso- 
tope shifts between the even isotopes, 
in addition to the hyperfine splitting 
of the odd isotopes. Each peak is 
labelled by the relative atomic mass. 
Each odd isotope gives rise to two peaks 
because of hyperfine structure (as in 
Fig. 6.11). For further details see Baird 
et al. (1983). 


we consider the Doppler width A fp of a line of wavelength 4: 


Afo~ >, 
where u is a typical atomic velocity. The factor of 2 appears because 
atoms can move towards and away from the observer. For this estimate 
we shall use the most probable velocity u = \/2kpT/m.** For cadmium 
at T = 300K, we have u ~ 200ms~! and thus the Doppler width of 
the lines with a wavelength of \ = 468nm is Afp = 2u/A = 0.9GHz, 
whereas A = 7.9 GHz for the 5s6s 2S; level. More generally, the Doppler 
width for a visible transition is plotted in Fig. 6.7 as a function of the 
atomic number. At room temperature the optical transitions of hydro- 
gen, the lightest element, have a Doppler width slightly less than the 
fine-structure splitting of the first excited state, so in this case the low- 
field Zeeman effect cannot be observed even for fine structure (let alone 
hyperfine structure).?° Figure 6.12 shows the results of an experimen- 
tal observation of the hyperfine structure and isotope shift of tin by a 
technique of Doppler-free laser spectroscopy (that will be described in 
Section 8.2). 

Doppler broadening is much less of a problem in direct measurements 
of the separation between hyperfine levels with microwave techniques 
(at frequencies of gigahertz), or the even smaller splitting of the Zeeman 
sub-levels that correspond to radio-frequency transitions. An example 
of a radio-frequency and microwave spectroscopy technique is outlined 
in the next section. 


(6.38) 


6.4.1 The atomic-beam technique 


An atomic-beam apparatus can be understood as an extension of the 
original Stern—Gerlach apparatus illustrated in Fig. 6.13. In the original 
Stern—Gerlach experiment a beam of silver atoms was sent through a 
region of strong gradient of the magnetic field and further downstream 
the atoms were deposited on a glass plate. Upon inspecting the plate, 
Stern and Gerlach found that the atoms appeared in two distinct places, 
showing that the atomic beam was split into two directions. This told 
them that angular momentum is quantised and that it can have values 


o 
N 


Signal 


oo 
a 
_ 


Frequency 


6.4 Measurement of hyperfine structure 


Beam source 


Collimator 
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that are half-integer multiples of h. (Orbital angular momentum would 
give 21 + 1 components, with / an integer.) To interpret the experiment 
we consider an atom in a magnetic field B with a gradient along the 
z-axis. The force exerted on the atom is (using eqn 5.11) 


de = a0 GMs. 
The ground level of silver, 5s "81/25 has:t = Vand J = 3-1/2) “Thus 
the force on atoms passing through the Stern—Gerlach magnet has the 
two values F = +ypdB/dz for m, = My = +3. This explains the 
separation of the atomic beam into two components and provides direct 
observational evidence for quantisation since the two emerging beams are 
well defined. For a classical vector the z-component of angular momen- 
tum would be spread over a range between the maximum and minimum 
values. 

An atomic-beam apparatus has two Stern—Gerlach-type magnets, as 
shown in Fig. 6.14, that create an inhomogeneous field in the A and B 
regions to deflect atoms according to their Mj; state. The A and B re- 
gions have a strong field (upB > A), associated with the high gradient 
of magnetic field there, so M, is a good quantum number and eqn 6.39 
gives the force. The transition to be measured occurs in the C region 
between the two state-selecting magnets where, generally, the atoms 
are in the low-field regime. The principles of the operation of such an 


Force = — 2 (6.39) 
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Fig. 6.13 In the original Stern—Gerlach 
apparatus a beam of silver atoms was 
sent through a region of strong gradi- 
ent of the magnetic field and further 
downstream the atoms were deposited 
on a glass plate. This simple experi- 
ment had a great influence on the devel- 
opment of quantum ideas and provides 
a very useful illustration of the concepts 
of quantisation (Feynman et al. 1963- 
1965). 


36 since V - B=0 we cannot have a 
gradient along the z-axis without there 
also being a gradient in another direc- 
tion, but the effect of a gradient in a di- 
rection perpendicular to B averages to 
zero as the magnetic moment precesses 
around the z-axis—for a more rigorous 
discussion see Chapter 10. 
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Fig. 6.14 (a) The magnetic resonance technique in an atomic beam. Atoms emerge from an oven and travel through the 
collimating slit ‘s’ to the detector. The deflection of atoms by the magnetic field gradient in the A and B regions depends on 
M7, as indicated. (b) Atoms that stay in the same Mj state are refocused onto the detector when the A and B regions have 
gradients in opposite directions. (c) Resonant interaction with radio-frequency radiation in the C region can change the M, 
quantum number, Mj; = +5 4 My = -4, so that atoms no longer reach the detector. (In a real apparatus the C region may 
be up to several metres long.) This is known as the flop-out arrangement and gives a signal as in (d). Further details are given 
in the text and in Corney (2000). 
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atomic-beam experiment are as follows. 


e Atoms emerge from an oven to form an atomic beam in an evacuated 
chamber. The atoms have a mean free path much greater than the 
length of the apparatus (i.e. there are no collisions). 


e The deflection of atoms in the A and B regions depends on M,. If 
these two regions have magnetic field gradients in the same direction, 
as indicated in Fig. 6.15, then atoms only reach the detector if their 
M, quantum number changes in the C region, i.e My = +3 4 My; = 
—4. This is known as the flop-in arrangement.°” 


e As the atoms travel from A into the C region their state changes 
adiabatically to that in a low magnetic field as shown in Fig. 6.16 
(and similarly as the field changes between the C and B regions). The 
transitions in the low-field region that can be observed are those that 
connect states with different values of M, in the high-field regions, 
e.g. the transitions: 


low frequency (AF = 0): 
P= =O ¢ F=1, Me=>—1 with AL =grpB; 
higher frequency (AF = +1): 


avin — Oe f= t Me—0 with AL=A, 
©=0, Mrp=0 © FH=1, Mrp=1 with AR=A-+ grjipB. 


In this example, the My = 1 < Mp = 0 change between the Zee- 
man sub-levels of the upper F’ = 1 level cannot be detected, but this 
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othe flop-out arrangement is shown in 
Fig. 6.14. 


Fig. 6.15 (a) The trajectories of atoms in an atomic-beam apparatus similar to that shown in Fig. 6.14, but magnetic field 
gradients in the A and B regions have the same direction. Atoms only reach the detector if their Mz quantum number is 
changed in the C region by the interaction with radio-frequency radiation. This is known as the flop-in arrangement and gives 


a signal as in (b). 
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Fig. 6.16 The method of magnetic res- 
onance in an atomic beam detects tran- 
sitions that occur at low field (in the C 
region shown in Fig. 6.14) which cause 
a change in the quantum number M, 
at high field (A and B regions). For ex- 
ample, an atom that follows the path 
indicated by the dotted line may start 
in the state M7 = +3, go adiabatically 
into the state F = 1,My = 0 in the 
C region, where it undergoes a transi- 
tion to the state with Mp = —1 and 
then end up in the state Mz; = —4 
in the B region (or it may follow the 
same path in the opposite direction). A 
strong magnetic field gradient (which is 
associated with a high field) is required 
in the A and B regions to give an ob- 
servable deflection of the atomic trajec- 
tories. 


38 Such quantum metrology has been 
used to define other fundamental con- 
stants, with the exception of the kilo- 
gram which is still defined in terms of 
a lump of platinum kept in a vault in 
Paris. 


39The next chapter gives a complete 
treatment of the interaction of atoms 
with radiation, 


a 
Low field B High field 
C region A and B regions 


does not lead to any loss of information since the hyperfine-structure 
constant A and gr can be deduced from the other transitions. 


These transitions at microwave- and radio-frequencies are clearly not 
electric dipole transitions since they occur between sub-levels of the 
ground configuration and have Al = 0 (and similarly for the maser). 
They are magnetic dipole transitions induced by the interaction of the 
oscillating magnetic field of the radiation with the magnetic dipole of 
the atoms. The selection rules for these M1 transitions are given in 
Appendix C. 


6.4.2 Atomic clocks 


An important application of the atomic-beam technique is atomic clocks, 
that are the primary standards of time. By international agreement the 
second is defined to be 9 192631770 oscillation periods of the hyperfine 
frequency in the ground state of '%°Cs (the only stable isotope of this 
element). Since all stable caesium atoms are identical, precise measure- 
ments of this atomic frequency in the National Standards Laboratories 
throughout the world should agree with each other, within experimental 
uncertainties.°° 

The definition of the second is realised using the hyperfine frequency 
of the F = 3,Mrp = 00 F = 4, Mp = 0 transition in caesium. This 
transition between two Mr = 0 states has no first-order Zeeman shift, 
but even the second-order shift has a significant effect at the level of 
precision required for a clock. The apparatus can be designed so that the 
dominant contribution to the line width comes from the finite interaction 
time 7 as atoms pass through the apparatus (transit time). Fourier 
transform theory gives the line width as®° 


1 eam 
oe ~beam (6.40) 


I ? 
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where Upeam is the typical velocity in the beam and ! is the length of 
the interaction region. Therefore the atomic beams used as primary 
standards are made as long as possible. An interaction over a region 
2m long gives a line width of Af = 100Hz.*° Thus the quality factor is 
f/Af ~ 10°; however, the centre frequency of the line can be determined 
to a small fraction of the line width.*! As a result of many years of 
careful work at standards laboratories, atomic clocks have uncertainties 
of less than 1 part in 10!*. This illustrates the incredible precision of 
radio-frequency and microwave techniques, but the use of slow atoms 
gives even higher precision, as we shall see in Chapter 10.4? There is 
a great need for accurate timekeeping for the synchronisation of global 
telecommunications networks, and for navigation both on Earth through 
the global positioning system (GPS) and for satellites in deep space. 
The atomic-beam technique has been described here because it fur- 
nishes a good example of the Zeeman effect on hyperfine structure, and 
it was also historically important in the development of atomic physics. 
The first atomic-beam experiments were carried out by Isador Rabi and 
he made numerous important discoveries. Using atomic hydrogen he 
showed that the proton has a magnetic moment of 2.8 un, which was 
about three times greater than expected for a point-like particle (cf. the 
electron with jp, i.e. one unit of relevant magnetic moment). This was 
the first evidence that the proton has internal structure. Other impor- 
tant techniques of radio-frequency spectroscopy such as optical pumping 
are described elsewhere, see Thorne et al. (1999) and Corney (2000). 


Further reading 


Further details of hyperfine structure and isotope shift including the 
electric quadrupole interaction can be found in Wood (1980). The dis- 
cussion of magnetic resonance techniques in condensed matter (Blundell 
2001) gives a useful complement to this chapter. 

The classic reference on atomic beams is Molecular beams (Ramsey 
1956). Further information on primary clocks can be found on the web 
sites of the National Physical Laboratory (for the UK), the National 
Institute of Standards and Technology (NIST, in the US), and similar 
sites for other countries. The two volumes by Vannier and Auduoin 
(1989) give a comprehensive treatment of atomic clocks and frequency 
standards. 
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40Caesium atoms have velocities of 
00s = (3kpT/Mos)!/?2 = 210ms7! at 
TE = BAKO), 


41yn a normal experiment it is hard to 
measure the centre of a line with an un- 
certainty of better than one-hundredth 
of the line width, i.e. a precision of only 
1 in 10!° in this case. 


42 The hydrogen maser achieves a long 
interaction time by confining the atoms 
in a glass bulb for tT ~ 0.158 to give a 
line width of order 10 Hz—the atoms 
bounce off the walls of the bulb with- 
out losing coherence. Thus masers can 
be more precise than atomic clocks but 
this does not mean that they are more 
accurate, i.e. the frequency of a given 
maser can be measured to more dec- 
imal places than that of an atomic- 
beam clock, but the maser frequency 
is shifted slightly from the hyperfine 
frequency of hydrogen by the effect of 
collisions with the walls. This shift 
leads to a frequency difference between 
masers that depends on how they were 
made. In contrast, caesium atomic 
clocks measure the unperturbed hyper- 
fine frequency of the atoms. (The use 
of cold atoms improves the performance 
of both masers and atomic clocks—the 
above remarks apply to uncooled sys- 
tems.) 
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Exercises 


(6.1) 


(6.2) 


The magnetic field in fine and hyperfine structure 
Calculate the magnetic flux density B at the cen- 
tre of a hydrogen atom for the Is 7S, /2 level and 
also for 2s aS iva. 

Calculate the orbital magnetic field experienced by 
a 2p-electron in hydrogen (eqn 2.47). 


Hyperfine structure of lithium 

The figure shows the energy levels of lithium in- 
volved in the 2s—2p transitions for the two isotopes 
SLi and "Li. (The figure is not to scale.) 

Explain in simple terms why the hyperfine split- 
ting is of order me/mp smaller than the fine- 
structure splitting in the 2p configuration of 
lithium. 

Explain, using the vector model or otherwise, why 
the hyperfine interaction splits a given J level into 
2J + 1 hyperfine levels if J < I and 22 +1 levels 
if I < J. Hence, deduce the nuclear spin of °Li 
and give the values of the quantum numbers L, J 
and F for all its hyperfine levels. Verify that the 
interval rule is obeyed in this case. 


a 


10.1 GHz 


228.2 MHz 


(6.3) 


(6.4) 


Determine from the data given the nuclear spin of 
"Li and give the values of L, J and F for each of 
the hyperfine levels on the figure. Calculate the 
hyperfine splitting of the interval marked X. 

(For the hyperfine levels a to d the parameter Ap; 
is positive for both isotopes.) 


Hyperfine structure of light elements 

Use the approximate formula in eqn 6.17 to esti- 
mate the hyperfine structure in the ground states 
of atomic hydrogen and lithium. Comment on the 
difference between your estimates and the actual 
values given for hydrogen in Section 6.1.1 and for 
Li in Exercise 6.2. 

Ratio of hyperfine splittings 

The spin and magnetic moment of the proton are 
(1/2, 2.79un), of the deuteron (1,0.857un) and of 
3He (1/2,—2.13yn). Calculate the ratio of the 
ground-state hyperfine splittings of (a) atomic hy- 
drogen and deuterium and (b) atomic hydrogen 
and the hydrogen-like ion *Het. 


TEA 


10.1 GHz 
d 
X 
Cc 
671 nm 
b 
803.5 MHz 


(6.5) 


(6.6) 


(6.7) 


Interval for hyperfine structure 


(a) Show that an interaction of the form AI- J 
leads to an interval rule, i.e. the splitting be- 
tween two sub-levels is proportional to the to- 
tal angular momentum quantum number F of 
the sub-level with the larger F’. 


(b) 
I I 
Peak Position (GHz) 

a 11.76 
b 10.51 
c 8.94 
d 7.06 
e 4.86 
33 2.39 


The table gives the positions of the six peaks 
in the spectrum shown in Fig. 6.6 that were 
not assigned quantum numbers in Exam- 
ple 6.1. It is the hyperfine structure of the 
upper level (°Dj1/2) of the transition in the 
isotope ‘°?Eu that determines the positions of 
these six peaks. What is the nuclear spin I of 
this isotope? Show that the spacing between 
these peaks obeys an interval rule and deter- 
mine the quantum number F associated with 
each peak. 


(c) For the isotope '°'Eu, whose hyperfine struc- 
ture was analysed in Example 6.1, the lower 
level of the transition has a hyperfine struc- 
ture constant of A(°S7/2) = 20MHz (mea- 
sured by the method of magnetic resonance 
in an atomic beam (Sandars and Woodgate 
1960)). What is the hyperfine-structure con- 
stant of this S72 level for the isotope '°°Eu, 
analysed in this exercise? 


Interval for hyperfine structure 
The 3d°4s4p °Pz/2 level of °°Mn is split by hy- 
perfine interaction into six levels that have sepa- 
rations 2599, 2146, 1696, 1258 and 838 MHz. De- 
duce the nuclear spin of °°>Mn and show that the 
separations confirm your value. 


Hyperfine structure 

When studied by means of high-resolution spectro- 
scopy, the resonance line 4p *P; /2-48 ase j2 of 
naturally-occurring potassium consists of four 
components with spacings and intensity ratios as 
shown in the following diagram. 
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Natural potassium is a mixture of °K and *!K in 
the ratio 14: 1. Explain the origin of the struc- 
ture, and deduce the nuclear spins and the ratio 
of the magnetic moments of the two isotopes. 


Energy A 


(6.8) Zeeman effect on HFS at all field strengths 


The figure shows the hyperfine structure of the 
ground level (5s *S,/2) of 37Rb (which has A/h = 
3.4GHz), as a function of the magnetic flux den- 
sity B. 


(a) Deduce the nuclear spin of this isotope of ru- 
bidium. 

(b) What are the appropriate quantum numbers 
for the states in both strong and weak fields 
(mark these on a copy of the figure)? 


(c) Show that in the weak-field regime the sepa- 
ration between states is the same in the upper 
and lower hyperfine levels. 


(d) In a strong field the energy of the states is 
given by eqn 6.33. Show that in this regime 
the four uppermost states have the same sepa- 
ration between them (marked A on the figure) 
as the four lower-lying states. 
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(6.9) 


(6.10) 


(6.11) 


(6.12) 


(e) Define what is meant by a ‘strong field’ when 
considering hyperfine structure. Give an ap- 
proximate numerical value for the magnetic 
field at which the cross-over from the weak- 
field to the strong-field regime occurs in this 
example. 


Isotope shift 

Estimate the contributions to the isotope shift be- 
tween §°Rb and $°Rb that arise from the mass and 
volume effects for the following transitions: 


(a) 5s-5p at a wavelength of ~ 790 nm; and 
(b) 5p-7s at a wavelength of ~ 730 nm. 


Estimate the total isotope shift for both transi- 
tions, being careful about the sign of each contri- 
bution. 


Volume shift 

Calculate the contribution of the finite nuclear size 
effect to the Lamb shift between the 2p ?P, /2 and 
2s 7S, /2 levels in atomic hydrogen (using the infor- 
mation in Section 6.2.2). The measured value of 
the proton charge radius has an uncertainty of 1% 
and the Lamb shift is about 1057.8 MHz. What 
is the highest precision with which experimental 
measurement of the Lamb shift can test quantum 
electrodynamics (expressed as parts per million)? 


Isotope shift 

Estimate the relative atomic mass A for which the 
volume and mass effect give a similar contribution 
to the isotope shift. 


Specific mass shift 
An atom with a nucleus of mass My and N elec- 


(6.13) 


trons has a kinetic energy T given by 


N 2 
P; 

; 2™Me i 
ql 


where pn is the momentum of the nucleus and p; 
is the momentum of the ith electron. The total of 
these momenta is zero in the centre-of-mass frame 
of the atom: 


N 
prt+)> pi = (1). 
a 


Use this equation to express T in terms of elec- 
tronic momenta only. 

Answer the following for either (a) a lithium atom 
(with N = 3) or (b) the general case of a multi- 
electron atom with a nucleus of finite mass (i.e. 
any real non-hydrogenic atom). Find the kinetic- 
energy terms that are ~ m-/Mn times the main 
contribution: a normal mass effect (cf. eqn 6.21) 
and a specific mass effect that depends on prod- 
ucts of the momenta p; - p;. 


Muonic atom 

A muon of mass m, = 207me is captured by an 
atom of sodium (Z = 11). Calculate the radius of 
the muon’s orbit for n = 1 using Bohr theory and 
explain why the atomic electrons have little influ- 
ence on the energy levels of the muonic atom. Cal- 
culate the binding energy of the muon for n = 1. 
Determine the volume effect on the 1s—2p transi- 
tion in this system; express the difference between 
the frequency of the transition for a nucleus with 
a radius rn (given by eqn 6.25) and the theoretical 
frequency for rn = 0 as a fraction of the transition 
frequency. 


The interaction of atoms 
with radiation 


To describe the interaction of a two-level atom with radiation we shall 
use a semiclassical treatment, i.e. the radiation is treated as a classical 
electric field but we use quantum mechanics to treat the atom. We shall 
calculate the effect of an oscillating electric field on the atom from first 
principles and show that this is equivalent to the usual time-dependent 


perturbation theory (TDPT) summarised by the golden rule (as men-— 


tioned in Section 2.2). The golden rule only gives the steady-state transi- 
tion rate and therefore does not describe adequately spectroscopy exper- 
iments with highly monochromatic radiation, e.g. radio-frequency radia- 
tion, microwaves or laser light, in which the amplitudes of the quantum 
states evolve coherently in time. In such experiments the damping time 
may be less than the total measurement time so that the atoms never 
reach the steady state. 

From the theory of the interaction with radiation, we will be able to 
find the conditions for which the equations reduce to a set of rate equa- 
tions that describe the populations of the atomic energy levels (with a 
steady-state solution). In particular, for an atom illuminated by broad- 
band radiation, this approach allows us to make a connection with Ein- 
stein’s treatment of radiation that was presented in Chapter 1; we shall 
find the Einstein B coefficient in terms of the matrix element for the 
transition. Then we can use the relation between Ao; and Bo; to cal- 
culate the spontaneous decay rate of the upper level. Finally, we shall 
study the roles of natural broadening and Doppler broadening in the ab- 
sorption of radiation by atoms, and derive some results needed in later 
chapters such as the a.c. Stark shift. 


7.1 Setting up the equations 


We start from the time-dependent Schrodinger equation? 


dw 
ih— = HV. (gs 
at fi) 
The Hamiltonian has two parts, 
H = Ho + H(t). (72) 


That part of the Hamiltonian that depends on time, Hy (t), describes 
the interaction with the oscillating electric field that perturbs the eigen- 
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The operators do not have hats so 
H = H, as previously. 
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2Note that Epo cos (wt) is not replaced 
by a complex quantity Ege i”*, be- 
cause a complex convention is built into 
quantum mechanics and we must not 
confuse one thing with another. 


3 This dipole approximation holds when 
the radiation has a wavelength greater 
than the size of the atom, i.e. A > ao, 
as discussed in Section 2.2. 


4Section 2.2 on selection rules shows 
how to treat other polarizations. 


functions of Ho; the unperturbed eigenvalues and eigenfunctions of Ho | 
are just the atomic energy levels and wavefunctions that we found in 
previous chapters. We write the wavefunction for the level with energy 
E,, as 

Wy (r,t) = Yn (rye nlm (7.3) 


For a system with only two levels, the spatial wavefunctions satisfy 
Hoy (r) = Ex (x), 
Hoye (vr) = Eo (r)- 


These atomic wavefunctions are not stationary states of the full Hamil- 
tonian, Hp + Hj (t), but the wavefunction at any instant of time can be 
expressed in terms of them as follows: 


(7.4) 


W (r,t) = cy (t) dy (rv) ee E*/™ + ep (t) He (1) eo H2t/” (7.5) 


or, in concise Dirac ket notation (shortening c; (t) to c1, w, = Ey /h, 
etc.), 
W (r,t) = c; |1)e** + & [2)e (7.6) 


Normalisation requires that the two time-dependent coefficients satisfy 


ese, (7.7) 


7.1.1 Perturbation by an oscillating electric field 


The oscillating electric field E = Ep cos (wt) of electromagnetic radiation 
produces a perturbation described by the Hamiltonian 


Hy (t) = er - Eo cos (wt) . (7.8) 


This corresponds to the energy of an electric dipole —er in the electric 
field, where r is the position of the electron with respect to the atom’s 
centre of mass.” Note that we have assumed that the electric dipole 
moment arises from a single electron but the treatment can easily be 
generalised by summing over all of the atom’s electrons. The interaction 
mixes the two states with energies EF, and E2. Substitution of eqn 7.6 
into the time-dependent Schrodinger eqn 7.1 leads to 


ic, = Qcos (wt) ee , (7.9) 


ic. = 1* cos (wt) ee, , (7.10) 
where wo = (E2 — E,) /h and the Rabi frequency 2 is defined by 


Q= oo = 5 | vt (r) x - Eo to (r) dr. (7.11) 


The electric field has almost uniform amplitude over the atomic wave- 
function so we take the amplitude |Eo| outside the integral.? Thus, for 
radiation linearly polarized along the z-axis, E = |Eo|€, cos (wt), we 
obtain* 


= eX 42|Eo| 


Q 
h ? 


(7.12) 


where 


To solve the coupled differential equations for c; (t) and c2 (t) we need 
to make further approximations. 


7.1.2 The rotating-wave approximation 


When all the population starts in the lower level, c,; (0) = 1 and cz (0) = 
0, integration of eqns 7.9 and 7.10 leads to 


ce, (t) =1, 
_ 2* §1—exp[i(wo+w)t] | 1—exp[i(wo — w)t] (7.14) 
oh ey { wT * ae Ne 


This gives a reasonable first-order approximation while c2 (t) remains 
small. For most cases of interest, the radiation has a frequency close to 
the atomic resonance at wo so the magnitude of the detuning is small, 
lwo — w| < wo, and hence wo + w ~ 2wo. Therefore we can neglect the 
term with denominator wo + w inside the curly brackets. This is the 
rotating-wave approximation.” The modulus-squared of the co-rotating 
term gives the probability of finding the atom in the upper state at time 


t as 
sin {(wo — w)t/2} |? 


b 
= |Q—_——— 15 
leo (t) ae (7.15) 
or, in terms of the variable x = (w — wo)t/2, 
1 sin? x 
ce (4)|" = 7 Ole cre (7.16) 


The sinc function (sinz) /x has a maximum at x = 0, and the first 
minimum occurs at x = 7 or Wo — w = +27/t, as illustrated in Fig. 7.1; 
the frequency spread decreases as the interaction time t¢ increases. 


|eo(#)|? 
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5 This is not true for the interaction of 
atoms with radiation at 10.6 4m from a 
COz laser. This laser radiation has a 
frequency closer to d.c. than to the res- 
onance frequency of the atoms, e.g. for 
rubidium with a resonance transition in 
the near infra-red (780 nm) we find that 
Wo © 15w, hence wo tw ~ wo—w. Thus 
the counter-rotating term must be kept. 
The quasi-electrostatic traps (QUEST) 
formed by such long wavelength laser 
beams are a form of the dipole-force 
traps described in Chapter 10. 


Fig. 7.1 The excitation probability 
function of the radiation frequency has 
a maximum at the atomic resonance. 
The line width is inversely proportional 
to the interaction time. The function 
sinc? also describes the Fraunhofer dif- 
fraction of light passing through a sin- 
gle slit—the diffraction angle decreases 
as the width of the aperture increases. 
The mathematical correspondence be- 
tween these two situations has a natural 
explanation in terms of Fourier trans- 
forms. 
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6 As in optics experiments with broad- 
band light, it is intensities that are 
summed, e.g. the formation of white- 
light fringes in the Michelson interfer- 
ometer. 


7.2 The Einstein B coefficients 


In the previous section we found the effect of an electric field Eo cos (wt) 
on the atom. To relate this to Einstein’s treatment of the interaction — 
with broadband radiation we consider what happens with radiation of 
energy density p(w) in the frequency interval w to w+dw. This produces 
an electric field of amplitude Eo (w) given by p(w) dw = eo Hj (w) /2. 
For this narrow (almost monochromatic) slice of the broad distribution 
eqn 7.12 gives 


oy? = |eX12Ho Ww) ; _ 2 [Xia]? 2 (w) dw 


. aan (7.17) 


Integration of eqn 7.15 over frequency gives the excitation probability 
for the broadband radiation as 


2 2 pwotA/2 in2 — w)t/2 
2e? |X12| i S08 sin* {(wo — w)t/ Lr 


C(t)? = 
| 2 ( )| €gh? pea (wo tee w)? 


(7.18) 


We integrate the squares of the amplitudes, rather than taking the square 
of the total amplitude, since contributions at different frequencies do 
not interfere. The range of integration A must be large compared to 
the extent of the sinc function, but this is easily fulfilled since as time 
increases this function becomes sharply peaked at wo. (In the limit 
t + co it becomes a Dirac delta function—see Loudon (2000).) Over 
the small range about wp where the sinc function has an appreciable 
value, a smooth function like p (w) varies little, so we take p (wo) outside 
the integral. A change of variable to = (w — wo) t/2, as in eqn 7.16, 
leads to 262 |X 2 Pre 

Jon (0)? > FF pln) x 5 f° 


The integration has limits of s = +¢ = tAt/4 > x and the integral ap- 
proximates closely to [°° «~? sin? x dz = 7. We make this assumption 
of a long interaction to find the steady-state excitation rate for broad- 
band radiation. The probability of transition from level 1 to 2 increases 
linearly with time corresponding to a transition rate of 


da. (7.19) 


= \co (t)|? me? |Xy2/? 


= ——— p (wo). (7.20) 


R 
ve t egh? 


Comparison with the upward rate B)2p(w) in Einstein’s treatment of 
radiation (eqn 1.25) shows that 
me? frele 


BS = 
a 3€oh? ; 


(7.21) 
where [Xaaf7 > |D12|? /3, and Dj» is the magnitude of the vector 


Dio = {1| r |2) = [ure d’r. ; (7.22) 
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The factor of 1/3 arises from averaging of D - €,aq, where €;aq is a unit 
vector along the electric field, over all the possible spatial orientations 
of the atom (see Exercise 7.6). The relation between Ag; and Bj2 in 
eqn 1.32 leads to - 

a 
n= ere x w? |[D12|? 5 (720) 
where a: = e?/ (4reghc) is the fine-structure constant. The matrix ele- 
ment between the initial and final states in eqn 7.22 depends on an inte- 
gral involving the electronic wavefunctions of the atom so, as emphasised 
previously, the Einstein coefficients are properties of the atom. For a typ- 
ical allowed transition the matrix element has an approximate value of 
Dj2 ~ 3a (this can be calculated analytically for hydrogenic systems). 
Using this estimate of D2 in the equation gives Aj, ~ 2m x 10%s7! for 
a transition of wavelength \ = 6 x 107’ m and g; = gz = 1. Although 
we have not given a physical explanation of spontaneous emission, Ein- 
stein’s argument allows us to calculate its rate; he obtained the relation 
between A»; and Bo; and we have used TDPT to determine Bo; from the 
atomic wavefunctions. For a two-level atom with an allowed transition 
between the levels the quantum mechanical result corresponds closely 
to the treatment of radiative decay using classical electromagnetism in 
Section 1.6. 


7.3. Interaction with monochromatic 
radiation 


The derivation of eqns 7.14 assumed that the monochromatic radiation 
perturbed the atom only weakly so that most of the population stayed 
in the initial state. We shall now find a solution without assuming a 
weak field. We write eqn 7.9 as 

ie. ee eae)! - | (7.24) 
and similarly for eqn 7.10. The term with (w + wp) t oscillates very fast 
and therefore averages to zero over any reasonable interaction time—this 
is the rotating-wave approximation (Section 7.1.2) and it leads to 


ic) == enelle—wolte ; 
, (7.25) 
id. = cye Ne ae 
2 
These combine to give 
d7e, deo n 
ae +i(w—wo) Ge t+ oy oy =O). (7.26) 


The solution of this second-order differential equation for the initial con- 
ditions c, (0) = 1 and c. (0) = 0 gives the probability of being in the yo, transitions between bound states 
upper state as’ the frequency 0 is real, so |Q|? = ?. 
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8 This is partly a consequence of the 
dependence on w® in eqn 7.23, but 
also because the magnetic dipole transi- 
tions have smaller matrix elements than 
electric dipole transitions. For electric 
dipole transitions in the optical region 
spontaneous emission washes out the 
Rabi oscillations on a time-scale of tens 
of nanoseconds (tr = 1/Agi, assuming 
that the predominant decay is from 2 
to 1, and we estimated Aj2 above). 
Nevertheless, experimenters have ob- 
served coherent oscillations by driving 
the transition with intense laser radi- 
ation to give a high Rabi frequency 
(ir SS 1D). 


°This can be shown by solving 
eqns 7.25 (and 7.26) for w = wo. 


ere eA 
|c2 (t)|? = Wa im (=) > (7.20) 
where 
Wao oe. (7.28) 
At resonance w = wo and W = 2, so 
|co (#)|? = sin? (F) , (7.29) 


The population oscillates between the two levels. When Nt = 7 all the 
population has gone from level 1 into the upper state, |c2 (t)|? = lian 
when Nt = 27 the atom has returned to the lower state. This behaviour 
is completely different from that of a two-level system governed by rate 
equations where the populations tend to become equal as the excitation 
rate increases and population inversion cannot occur. These Rabi os- 
cillations between the two levels are readily observed in radio-frequency 
spectroscopy, e.g. for transitions between Zeeman or hyperfine states. 
Radio-frequency and microwave transitions have negligible spontaneous 
emission so that, in most cases, the atoms evolve coherently.® 


7.3.1. The concepts of 7-pulses and 7/2-pulses 


A pulse of resonant radiation that has a duration of t;, = 7/Q is called 
a m-pulse and from eqn 7.29 we see that Nt = 7 results in the complete 
transfer of population from one state to the other, e.g. an atom initially 
in |1) ends up in |2) after the pulse. This contrasts with illumination by 
broadband radiation where the populations (per state) become equal as 
the energy density p(w) increases. More precisely, a 7-pulse swaps the 
states in a superposition:® 


ci |1) + c2 |2) > —i {e, |2) + ee |1)}.- (7.30) 


This swap operation is sometimes also expressed as |1) © |2), but the 
factor of —i is important in atom interferometry, as shown in Exer- 
cise 7.3. 

Interferometry experiments also use 7/2-pulses that have half the du- 
ration of a 7-pulse (for the same Rabi frequency (2). For an atom initially 
in state |1), the 7/2-pulse puts its wavefunction into a superposition of 
states |1) and |2) with equal amplitudes (see Exercise 7.3). 


7.3.2 The Bloch vector and Bloch sphere 


In this section we find the electric dipole moment induced on a atom by 
radiation, and introduce a very powerful way of describing the behaviour 
of two-level systems by the Bloch vector. We assume that the electric 
field is along €,, as in eqn 7.12. The component of the dipole along this 
direction is given by the expectation value 


ee / wt(t) ex U(t) dr. (7.31) 
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Using eqn 7.5 for W (t) gives this dipole moment of the atom as 


D,(t) ce [lacs a cge7 2h yo)” ie (cre#2* apy re ene #2* apo) d°r 


Gee) 


= 630, Xo," SF CiCo Xj, ot - 


Here wo = We — w;. The dipole moment is a real quantity since from 
eqn 7.13 we see that X21 = (X12)*, and also X11 = X22 = 0. To calculate 
this dipole moment induced by the applied field we need to know the 
bilinear quantities cfc2 and c}c,. These are some of the elements of the 


density matrix!® 
= te P12 
p21 p22) 


my wl=(S) (et 4) = 


Off-diagonal elements of the density matrix are called coherences and 
they represent the response of the system at the driving frequency (eqn 
7.32). The diagonal elements |c|” and |c2|” are the populations. We 
define the new variables 


ler? cre} 


(7.33) 


C2CF |c2|? 


& = ce %#/2 | (7.34) 
C2 = eneot/? 5 (7.35) 


where 6 = w — Wo is the detuning of the radiation from the atomic 
resonance. This transformation does not affect the populations ( pi; = 
fii and p22 = p22) but the coherences become pi2 = fi2 exp(—idt) and 
p21 = poi exp(idt) = (pi2)*. In terms of these coherences the dipole 
moment is 


I! 


—eD, (t) —eX 192 { pize'”? ae pue i} = —eX19 {pi2e* af pue “'} 


(7.36) 


—eX 42 (ucoswt — vsinwt). 


The coherences pj2 and p21 give the response of the atom at w, the 
(angular) frequency of the applied field. The real and imaginary parts 
of p12 (multiplied by 2) are: 


= Pi2 + pri, 


i ee (7.37) 
0 = —i (Pio — fa) - 


In eqn 7.36 we see that u and v are the in-phase and quadrature com- 
ponents of the dipole in a frame rotating at w. To find expressions for 
Piz, P21 and poz, and hence u and v, we start by writing eqns 7.25 for 
¢, and cp in terms of 6 as follows:!? 


ig = eee ‘ (7.38) 
Q 
1€23 = Cie ie ° (7.39) 


Differentiation of eqn 7.34 yields'? 


10-This is the outer product of |W) and 
its Hermitian conjugate (U| = |W)', the 
transposed conjugate of the matrix rep- 
resenting |W). This way of writing the 
information about the two levels is an 
extremely useful formalism for treating 
quantum systems. However, we have no 
need to digress into the theory of den- 
sity matrices here and we simply adopt 
it as a convenient notation. 


1lWe assume that X 12 is real. This is 
true for transitions between two bound 
states of the atom—the radial wave- 
functions are real and the discussion 
of selection rules shows that the in- 
tegration over the angular momentum 
eigenfunctions also gives a real contri- 
bution to the matrix element—the in- 
tegral over @ is zero unless the terms 
containing powers of exp(—id) cancel. 


12 all the steps in this lengthy proce- 
dure cannot be written down here but 
enough information is given for metic- 
ulous readers to fill in the gaps. 


13 spontaneous decay is ignored here— 
this section deals only with coherent 
evolution of the states. 
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14This is appropriate for calculating 
absorption. Alternatively, p22 — p11 
could be chosen as a variable—this pop- 
ulation inversion determines the gain in 
lasers. 


a = Ge 5t/2 oo Peel? : (7.40) 
Multiplication by i and the use of eqns 7.38, 7.34 and 7.35 yields an 
equation for ¢ (and similarly we can obtain ¢: from eqn 7.39): 


icq.) = = (6 C1 +02), 
a, (7.41) 
id = 5 (9% — 5%). 


: a eS 
From these we find that the time derivatives p,. = CiCz + €1C2, etc. are 


dp. dpa \* oc «| (ih 
a _ ( 7) = —i6 pi2 + — (p11 — p22), 


dp22 dpi ue _~ ) 
di de ee 
The last equation is consistent with normalisation in eqn 7.7, i.e. 
po2 + pir = 1. (7.43) 


In terms of u and v in eqns 7.37 these equations become 


wa 0 vs 
v=—du+Q (pir i p22) 5) (7.44) 
= 
P22 = 2° 


We can write the population difference p11 — p22 as** 


W = P11 — p22, (7.45) 


so that finally we get the following compact set of equations: 


i— Ov. 
v=—du+Qw, (7.46) 
w= —-—Ov. 


These eqns 7.46 can be written in vector notation as: 
R=Rx (M@, +663) =Rx W, (7.47) 
by taking u, v and w as the components of the Bloch vector 
R=ue; + veo + we, (7.48) 
and defining the vector 
W =e, + 6e3 (7.49) 


that has magnitude W = /? + 6? (cf. eqn 7.28). The cross-product of 
the two vectors in eqn 7.47 is orthogonal to both R and W. This implies 
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that R-R =0 so |R|’ is constant—it is straightforward! to show that 15|R| = 1 for the state u=v =0, w = 
this constant is unity so that |R|? = |u|? + |u|? +|w|? =1. The Bloch 1 and it always remains unity. This can 
vector corresponds to the position vector of points on the surface of a cea Be Ae by 2 ong te 
sphere with unit radius; this Bloch sphere is shown in Fig. 7.2. See gta 

It also follows from eqn 7.47 that R- W = 0 and so R- W = RW cos@ 
is constant. For excitation with a fixed Rabi frequency and detuning 
the magnitude W is constant, and since R is also fixed the Bloch vector 
moves around a cone with @ constant, as illustrated in Fig. 7.2(d). In 
this case p22 varies as in eqn 7.27 and 


2? it 
w= 1 - 2p22 Se (=). 


This motion of the Bloch vector for the state of the atom resembles that 164. gescribed in Blundell (2001, Ap- 
of a magnetic moment in a magnetic field,'® e.g. for adiabatic motion the _ pendix G). 


(a) |1) (b) |!) 


fy) ~ (2) > 1) + 12) 


(d) & 


Fig. 7.2 The Bloch sphere. The position vectors of points on its surface represent the states of a two-level system (in 
Hilbert space). Examples of states are shown in (a) and (b). At the poles of the sphere the Bloch vector is R = wé3, with 
w = +1 corresponding to the states |1) and |2), respectively. States that lie on the equator of the Bloch sphere have the 
form R = we; + vee, e.g. the states for which R = vé2 with u = 0 and v = +1 are shown in (b) and these correspond 
to (\1) +i|2))/V2, respectively (normalisation constants are not given in the figure for clarity). These examples illustrate an 
interesting property of this representation of quantum states, namely that diametrically-opposite states on the Bloch sphere 
are orthogonal. (c) The evolution of the Bloch vector for a system driven by a resonant field, i.e. 6 = 0 so that W = (2; in 
eqn 7.49. The evolution follows a great circle from |1) at the north pole to |2) at the south pole and back again, as described 
in Example 7.1. The Bloch vector remains perpendicular to W. (d) When 6 # 0 the Bloch vector also has a fixed angle with 
respect to W, since R- W = RW cos@ is constant, but @ is not equal to 7/2. (This quantum mechanical description of the 
two-level atom is equivalent to that for a spin-1/2 system.) 
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1Ttn this particular example the fi- 
nal state is obvious by inspection, but 
clearly the same principles apply to 
other initial states, e.g. states of the 
form {{1) + e!? |2)} /./2 that lie on the 
equator of the sphere. The Bloch 
sphere is indispensable for thinking 
about more complex pulse sequences, 
such as those used in nuclear magnetic 
resonance (NMR). 


18This 
Je2|? <& Ile 


assumes weak excitation: 


19-Dhis is not the FWHM but it is close 
enough for our purposes. 


20 his expression is equivalent to 
eqn 6.40 that was used to calculate the 
line width for an atomic clock. 


energy —-B is constant, and the magnetic moment precesses around 
the direction of the field B = Be,. In the Bloch description the fictitious 
magnetic field lies along W and the magnitude W in eqn 7.28 determines 
the precession rate. 


Example 7.1 Resonant excitation (6 = 0) gives W = Qe; and R 
describes a cone about e;. An important case is when all the population 
starts in level 1 so that initially R-e, = 0; in this case the Bloch vector 
rotates in the plane perpendicular to €; mapping out a great circle on 
the Bloch sphere, as drawn in Fig. 7.2(c). This motion corresponds to 
the Rabi oscillations (eqn 7.29). In this picture a 7/2-pulse rotates the 
Bloch vector through 7/2 about €,. A sequence of two 7/2-pulses gives 
a 7-pulse that rotates the Bloch vector (clockwise) through 7 about 
€,, e.g. w = 1 > w = —1 and this represents the transfer of all the 
population from level 1 to 2.1” This is consistent with the more general 
statement given in eqn 7.30. 


The very brief introduction to the Bloch sphere given in this section 
shows clearly that a two-level atom’s response to radiation does not 
increase indefinitely with the driving field—beyond a certain point an 
increase in the applied field (or the interaction time) does not produce 
a larger dipole moment or change in population. This ‘saturation’ has 
important consequences and makes the two-level system different from a 
classical oscillator (where the dipole moment is proportional to the field, 
as will be shown in Section 7.5). 


7.4 Ramsey fringes 


The previous sections in this chapter have shown how to calculate the 
response of a two-level atom to radiation. In this section we shall apply 
this theory to radio-frequency spectroscopy, e.g. the method of magnetic 
resonance in an atomic beam described in Chapter 6. However, the same 
principles are important whenever line width is limited by the finite 
interaction time, both within atomic physics and more generally. In 
particular, we shall calculate what happens to an atom subjected to two 
pulses of radiation since such a double-pulse sequence has favourable 
properties for precision measurements. 

An atom that interacts with a square pulse of radiation, i.e. an os- 
cillating electric field of constant amplitude from time t = 0 to 7p, and 
Eo = 0 otherwise, has a probability of excitation as in eqn 7.15.18 This 
excitation probability is plotted in Fig. 7.1 as a function of the radia- 
tion’s frequency detuning from the (angular) resonance frequency wo. As 
stated below eqn 7.16, the frequency spread given by the first minimum 
of the sinc? function corresponds to a width!® 

Ap. 
27 


Tp 


(7.50) 


The frequency spread is inversely proportional to the interaction time,?° 


as expected from the Fourier transform relationship of the frequency and 
time domains. 

We shall now consider what happens when an atom interacts with two 
separate pulses of radiation, from time t = 0 to 7, and again from t = T 
to T+ 7p. Integration of eqn 7.10 with the initial condition c. = 0 at 
t = 0 yields 


a (t)= = { tS es w) To 


(7.51) 


+ expfi(wo — yr} See = ; 


Wo -— WwW 


This is the amplitude excited to the upper level after both pulses (t > 
T +7,). The first term in this expression is the amplitude arising from 
the first pulse and it equals the part of eqn 7.14 that remains after 
making the rotating-wave approximation.”! Within this approximation, 
interaction with the second pulse produces a similar term multiplied by 
a phase factor of exp[i(wo —w)T]. Either of the pulses acting alone would 
affect the system in the same way, i.e. the same excitation probability 
|c2|? as in eqn 7.15. When there are two pulses the amplitudes in the 
excited state interfere giving 

2 


leo |? = pte x |1 + expfi(wo — w)T])? 
ae 2 
= 2 ewe ee (S) . (7.52) 


where 6 = w — Wp is the frequency detuning. The double-pulse sequence 
produces a signal of the form shown in Fig. 7.3. These are called Ramsey 
fringes after Norman Ramsey and they have a very close similarity to the 
interference fringes seen in a Young’s double-slit experiment in optics— 
Fraunhofer diffraction of light with wavevector k from two slits of width 
a and separation d leads to an intensity distribution as a function of 
angle @ given by?” 

I = Ip cos?” €E kdsin 0) sinc” (5 kasin a) (7.53) 
The overall envelope proportional to sinc? comes from single-slit diffrac- 
tion. The cos? function determines the width of the central peak in both 
eqns 7.53 and 7.52.7" 

For the atom excited by two pulses of radiation the excitation drops 
from the maximum value at w = wo to zero when 6T/2 = 7/2 (or to 
half the maximum at 7/4); so the central peak has a width (FWHM) of 
Aw = 7/T, or equivalently 

1 
Ape vie 
This shows that Ramsey fringes from two interactions separated by time 
T have half the width of the signal from a single long interaction of 


(7.54) 
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au Neglecting terms with wo + w in the 
denominator. 


22 See Section 11.1 and Brooker (2003). 


237 both quantum mechanics and op- 
tics, the amplitudes of waves inter- 
fere constructively, or destructively, de- 
pending on their relative phase. Also, 
the calculation of Fraunhofer diffrac- 
tion as a Fourier transform of ampli- 
tude in the plane of the object closely 
parallels the Fourier transform relation- 
ship between pulses in the time domain 
and the frequency response of the sys- 
tem. 
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Fig. 7.3 Ramsey fringes from an 
atomic fountain of caesium, showing 
the transition probability for the F = 3, 
Mig == (0) Woy 0! = AL hen == O ineine 
sition versus the frequency of the mi- 
crowave radiation in the interaction re- 
gion. The height of the fountain is 
3lcm, giving a fringe width just be- 
low 1 Hz (ie. Af = 1/(2T) = 0.98 Hz, 
see text)—the envelope of the fringes 
has a more complicated shape than that 
derived in the text, but this has lit- 
tle influence since, during operation as 
a frequency standard, the microwaves 
have a frequency very close to the cen- 
tre which, by definition, corresponds to 
9192631 770Hz. This is real experi- 
mental data but the noise is not visi- 
ble because the signal-to-noise ratio is 
about 1000 (near the centre), and with 
such an extremely high-quality signal 
the short-term stability of a microwave 
source referenced to the caesium tran- 
sition is about 1 x 10~!% for 1s of av- 
eraging. Courtesy of Dale Henderson, 
Krzysztof Szymaniec and Chalupczak 
Witold, National Physical Laboratory, 
‘Teddington, UK. 


24 Ramsey originally introduced two 
separated interactions in an atomic- 
beam experiment to avoid the line 
broadening by inhomogeneous mag- 
netic fields. A small phase differ- 
ence between the two interaction re- 
gions just causes a phase shift of the 
fringes, whereas if the atom interacts 
with the radiation throughout a region 
where the field varies then the contri- 
butions from each part of the interac- 
tion region do not add in phase. The 
phasor description, commonly used in 
optics, gives a good way of thinking 
about this— Young’s fringes have a high 
contrast when the two slits of separa- 
tion d are illuminated coherently, but to 
achieve the diffraction limit from a sin- 
gle wide slit of width d requires a good 
wavefront across the whole aperture. 


251m the Bloch sphere description this 
corresponds to the following path: the 
initial 7/2-pulse causes €3 — €2, then 
the accumulated phase causes the state 
vector to move around the equator of 
the sphere to —€2, from whence the fi- 
nal 7/2-pulse takes the system back up 
to €3 (see Fig. 7.2). This formalism al- 
lows quantitative calculation of the fi- 
nal state for any type of pulse. 


Transition probability 
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duration T (cf. eqn 7.50); also, it is often preferable to have two sepa- 
rated interaction regions, e.g. for measurements in an atomic fountain 
as described in Chapter 8.74 

In practice, microwave experiments use strong rather than weak exci- 
tation (as assumed above) to obtain the maximum signal, i.e. |c2|” ~ 1. 
This does not change the width of the Ramsey fringes, as shown by 
considering two 7/2-pulses separated by time 7’. Jf no phase shift ac- 
cumulates between the two pulses they add together to act as a 7-pulse 
that transfers all the population to the upper state—from the north to 
the south pole of the Bloch sphere, as shown in Fig. 7.2(c). But if a 
relative phase shift of 7 accrues during the time interval T then there 
is destructive interference between the amplitudes in the upper state 
produced by the two pulses.”° Thus the first minimum from the central 
fringe occurs for 6 T = 7, which is the condition that gave eqn 7.54, and 
so that equation remains accurate. 


7.5 Radiative damping 


This section shows how damping affects the coherent evolution of the 
Bloch vector described in the previous section. It is shown by analogy 
with the description of a classical dipole that a damping term should be 
introduced into eqns 7.46. Ultimately, such an argument by analogy is 
only a justification that the equations have an appropriate form rather 


than a derivation, but this approach does give useful physical insight. 


7.5.1 The damping of a classical dipole 


Damping by spontaneous emission can be introduced into the quantum 
treatment of the two-level atom, in a physically reasonable way, by com- 
parison with the damping of a classical system. To do this we first 
review the damped harmonic oscillator and express the classical equa- 
tions in a suitable form. For a harmonic oscillator of natural frequency 
wo, Newton’s second law leads to the equation of motion 


F(t) 


E+ Brt+upr = = coswt. (7.55) 
The driving force has amplitude F(t) that varies slowly compared to the 
oscillation at the driving frequency. (The friction force is Friction = —aZ 
and 2 = a/m, where m is the mass.) To solve this we look for a solution 
of the form 


x =U (t)coswt — Y(t)sinwt. (7.56) 


This anticipates that most of the time dependence of the solution is an 
oscillation at frequency w, and U is the component of the displacement 
in-phase with the force, and the quadrature component V has a phase 
lead?® of 1/2 with respect to F coswt.?” Substitution of eqn 7.56 into 
eqn 7.55 and equating terms that depend on sinwt and coswt gives 


U = (w=) V— By. 
2 (7.57) 


respectively. The amplitudes // and V change in time as the amplitude 
of the force changes, but we assume that these changes occur slowly 
compared to the fast oscillation at w. This slowly-varying envelope ap- 
proximation has been used in the derivation of eqn 7.57, i.e. U and V have 
been neglected and VY <wyV (see Allen and Eberly 1975). By setting 
U = VY = 0 we find the form of the solution that is a good approxima- 
tion when the amplitudes and the force change slowly compared to the 
damping time of the system 1//: 


(Cok) = Be) F 
8 oman)? + (BPR? Fmd’ a 
v= meld Eis (7.59) 


(w — wo)? + (6/2)? 2mw 


The approximation w? — w2 = (w+wo)(w—WwWo) & 2w(w—wo) has 
been used so these expressions are only valid close to resonance*®—they 
give the wrong result for w ~ 0. The phase is found from tan@ = V/U 
(see Exercise 7.7). The phase lies in the range ¢ = 0 to —7 for a force 


of constant amplitude.?9 
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26.4 phase lead occurs when V(t) > 0, 
since —sinwt = cos(wt + 7/2); but of- 
ten V(t) < 0 and this corresponds to a 
phase lag. 


27- This method of considering the com- 
ponents U and Y is equivalent to the 
phasor description which is widely used 
in the theory of a.c. circuits (made from 
capacitors, inductors and resistors) to 
represent the phase lag, or lead, be- 
tween the current and an applied volt- 
age of the form Vo cos wt. 


28 This is a very good approximation 
for optical transitions since typically 
B/wo ~ 10~®. The assumption of small 
damping is implicit in these equations 
and therefore the resonance frequency 
is very close to wo. 


297¢ is well known from the study of 
the .harmonic oscillator with damping 
that the mechanical response lags be- 
hind the driving. At low frequencies 
the system closely follows the driving 
force, but above the resonance, where 
Ww > wo, the phase shift lies in the range 
—t/2<p<—T. 


136 The interaction of atoms with radiation 


30 For light damping (8/wo < 1) the 
decaying oscillations have angular fre- 
quency w! = {w2 — p2/ayi? ~ wo. 


3lWe use the same slowly varying en- 
velope approximation as for eqn 7.57, 


namely V < wY, etc. 


32 This is normally a positive quantity 
since r < 0 (see eqn 7.59). 


33 See Fig. 9.12. 


The sum of the kinetic energy 4 ma and the potential energy 5 mur? 


gives the total energy E = mu" (U? a vy?) using the approximation 
we ~ w?. This changes at the rate E = mw*(UU + VY), and hence from 
eqns 7.57 for U and Y we find that 


f=—ph= FV5 (7.60) 


For no driving force (F = 0) the energy decays away. This is consistent 
with the complementary function of eqn 7.55 (the solution for F = 0) 
that gives the oscillator’s transient response as*° 


x = ape °*/? cos (w't +). (7.61) 


Energy is proportional to the amplitude of the motion squared, hence 
the exp (—Gt/2) dependence in eqn 7.61 becomes E «x exp(—/t). The 
term F'Vw/2 in eqn 7.60 is the rate at which the driving force does work 
on the oscillator; this can be seen from the following expression for power 
as the force times the velocity: 


P = F (t)cos(wt) z. (7.62) 


The overlining indicates an average over many periods of the oscillation 
at w, but the amplitude of the force F(t) may vary (slowly) on a longer 
time-scale. Differentiation of eqn 7.56 gives the velocity as?! 


g 2 —Uwsinwt — Vwcoswt , (7.63) 
and only the cosine term contributes to the cycle-averaged power:?? 
P=-F(t) Vs (7.64) 


This shows that absorption of energy arises from the quadrature com- 
ponent of the response V. 

In the classical model of an atom as an electron that undergoes simple 
harmonic motion the oscillating electric field of the incident radiation 
produces a force F(t) = —e|Eo| cos wt on the electron. Each atom in the 
sample has an electric dipole moment of D = —ez (along the direction 
of the applied field). The quadrature component of the dipole that 
gives absorption has a Lorentzian function of frequency as in eqn 7.59. 
The in-phase component of the dipole that determines the polarization 
of the medium and its refractive index (Fox 2001) has the frequency 
dependence given in eqn 7.58.°% 

When any changes in the driving force occur slowly eqn 7.60 has the 
following quasi-steady-state solution: 


_ |FV|w 

= ee 
This shows that the energy of the classical oscillator increases linearly 
with the strength of the driving force, whereas in a two-level system the 


energy has an upper limit when all the atoms have been excited to the 
upper level. 


E 


(7.65) 


7.5.2 The optical Bloch equations 


A two-level atom has an energy proportional to the excited-state pop- 
ulation, E = po2hwo. By analogy with eqn 7.60 for the energy of a 
classical oscillator, we introduce a damping term into eqn 7.44 to give 


, 19) 

Pog = —Vpo2 + me 
In the absence of the driving term (Q = 0) this gives exponential decay of 
the population in level 2, i.e. p22(t) = p22(0) exp (—It). In this analogy, 
between the quantum system and a classical oscillator, corresponds to 
@. From eqns 7.57 we see that the coherences u and v have a damping 
factor of ['/2 and eqns 7.46 become the optical Bloch equations*4 


(7.66) 


du— su, 


—u+ Mw ~ 50, 
w= —-Ov -—T(w—-1). 


u 


p (7.67) 


For 2 = 0 the population difference w > 1. These optical Bloch equa- 
tions describe the excitation of a two-level atom by radiation close to 
resonance for a transition that decays by spontaneous emission. There 
is not room here to explore all of the features of these equations and 
their many diverse and interesting applications; we shall concentrate 
on the steady-state solution that is established at times which are long 
compared to the lifetime of the upper level (t >> I'~!), namely*® 


u 1 06 
==> OT 2 7.68 
UV 62+ 0?/24 12/4 Puaegy ( ) 


These show that a strong driving field (Q — oo) tends to equalise the 
populations, i.e. w + 0. Equivalently, the upper level has a steady-state 
population of 

l—-w _ 07/4 

2 624 0?/24+12/4’ 
and p22 — 1/2 as the intensity increases. This key result is used in 
Chapter 9 on radiation forces. 

In the above, the optical Bloch equations have been justified by anal- 
ogy with a damped classical oscillator but they also closely resemble the 
Bloch equations that describe the behaviour of a spin-1/2 particle in a 
combination of static and oscillating magnetic fields.°* The reader fa- 
miliar with magnetic resonance techniques may find it useful to make an 
analogy with that historically important case.°’ For times much shorter 
than any damping or relaxation time, the two-level atom and spin-1/2 
system behave in the same way, i.e. they have a coherent evolution such 
as m- and 7/2-pulses, etc. A steady-state solution of the optical Bloch 
equations has been presented (and nothing has been said about the dif- 
ferent result for a spin-1/2 system). 


Peo = (7.69) 
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34The main purpose of the rather 
lengthy discussion of the classical case 
was to highlight the correspondence be- 
tween u, v and U, V to make this step 
seem reasonable. An auxiliary feature 
of this approach is to remind the reader 
of the classical electron oscillator model 
of absorption and dispersion (which is 
important in atomic physics). 


35The steady-state solution is obtained 
by setting uw =v = w = 0 in eqns 7.67 
to give three simultaneous equations. 


36The Zeeman effect leads to a split- 
ting between states with ms; = +1/2 
to give a two-level system, and the os- 
cillating magnetic field drives magnetic 
dipole transitions between the levels. In 
atomic physics such transitions occur 
between Zeeman states and hyperfine 
levels (Chapter 6). 


37 The Bloch equations were well known 
from magnetic resonance techniques 
before lasers allowed the observation 
of coherent phenomena in optical tran- 
sitions. Radio-frequency transitions 
have negligible spontaneous emission 
and the magnetic dipole of the whole 
sample decays by other mechanisms. 
Where the optical Bloch equations 
(eqns 7.67) have decay constants of [ 
and ['/2 for the population and co- 
herences, respectively, the Bloch equa- 
tions have 1/T, and 1/T2. The de- 
cay rates 1/7; and 1/T> in magnetic 
resonance techniques are expressed in 
terms of 7] and To, the longitudinal 
and transverse relaxation times, respec- 
tively. Under some conditions the two 
relaxation times are similar, but in 
other cases Ty < J). Ty describes 
the relaxation of the component of the 
magnetic moment parallel to the ap- 
plied field B which requires exchange 
of energy (e.g. with the phonons in a 
solid). T2 arises from the dephasing of 
individual magnetic moments (spins) so 
that the magnetisation of the sample 
perpendicular to B decays. For further 
details see condensed matter texts, e.g. 
Kittel (2004). 
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33 7 represents the number density and 
has dimensions of m~*, following the 
usual convention in laser physics. 


Fig. 7.4 Atoms with number density 
N distributed in a slab of thickness Az 
absorb a fraction NoAz of the inci- 
dent beam intensity, where o is defined 
as the cross-section (for absorption). 
NAz is the number of atoms per unit 
area and o represents the ‘target’ area 
that each atom presents. We assume 
that the motion of the target atoms 
can be ignored (Doppler broadening is 
treated in Exercise 7.9) and also that 
atoms in the next layer (of thickness 
6z) cannot ‘hide’ behind these atoms 
(see Brooker 2003, Problem 3.26). 


7.6 The optical absorption cross-section 


Monochromatic radiation causes an atom to undergo Rabi oscillations, 
but when the transition has damping the atom settles down to a steady 
state in which the excitation rate equals the decay rate. This has been 
shown explicitly above for an optical transition with spontaneous emis- 
sion, but the same reduction of the coherent evolution of quantum am- 
plitudes to a simple rate equation for populations (amplitudes squared) 
also occurs for other line-broadening mechanisms, e.g. Doppler broad- 
ening (Chapter 8) and collisions. Thus the equilibrium situation for 
monochromatic radiation is described by rate equations like those in 
Einstein’s treatment of excitation by broadband radiation (eqns 1.25). 
It is convenient to write these rate equations in terms of an optical ab- 
sorption cross-section defined in the usual way, as in Fig. 7.4. Consider a 
beam of particles (in this case photons) passing through a medium with 
N atoms per unit volume.*® A slab of thickness Az has N Az atoms per 
unit area and the fraction of particles absorbed by the target atoms is 
NoAz, where a is defined as the cross-section; Nao Az gives the fraction 
of the target area covered by the atoms and this equals the probabil- 
ity that an incident particle hits an atom in the target (as it passes 
through the slab). The parameter o that characterises the probability 
of absorption is equally well definable in quantum mechanics (in which 
photons and particles are delocalised, fuzzy objects) even though this 
cross-section generally has little relation to the physical size of the ob- 
ject (as we shall see). The probability of absorption equals the fraction 
of intensity lost, AJ/I = —NoAz, so the attenuation of the beam is 
described by 

gH. 

dz ~ 
where k (w) is the absorption coefficient at the angular frequency w of 
the incident photons. Integration gives an exponential decrease of the 
intensity with distance, namely 


—k(w) I = —-No(w)I, (7.70) 


I (w,z) = I(w,0) exp {—K (w) z}. eral) 


N atoms per unit volume 


Area, A 


Incident 
light 


Cross-section 
o per atom 


> -— 
Thickness, Az 
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This formula, known as Beer’s law (see Fox 2001), works well for ab- 
sorption of low-intensity light that leaves most of the population in the 
ground state. Intense laser light significantly affects the populations of 
the atomic levels and we must take this into account. Atoms in level 2 
undergo stimulated emission and this process leads to a gain in intensity 
(amplification) that offsets some of the absorption. Equation 7.70 must 
be modified to*® 


dl 

dz 
Absorption and stimulated emission have the same cross-section. For 
the specific case of a two-level atom this can be seen from the symmetry 
with respect to the exchange of the labels 1 and 2 in the treatment 
of the two-level atom in the early parts of this chapter; the oscillating 
electric field drives the transition from 1 to 2 at the same rate as the 
reverse process—only the spontaneous emission goes one way. This is an 
example of the general principle that a strong absorber is also a strong 
emitter.?° This is also linked to the equality of the Einstein coefficients, 
Bj. = Bo, for non-degenerate levels. The population densities in the 
two levels obey the conservation equation N = N, + No.*! In the steady 
state conservation of energy per unit volume of the absorber requires 
that 


—K (w) I (w) = —(M, — No)o (w) I (w). (ane) 


(Ni = N2)o (w) if (w) = Nz Aoi hw. (7.73) 


On the left-hand side is the amount by which the rate of absorption 
of energy exceeds the stimulated emission, i.e. the net rate of energy 
absorbed per unit volume. On the right-hand side is the rate at which the 
atoms scatter energy out of the beam—the rate of spontaneous emission 
for atoms in the excited state times hw.** The number densities are 
related to the variables in the optical Bloch equations by po2 = N2/N 
and 


No = Ni 
ee .74 
—, (7.74) 
and w and p22 are given in eqns 7.68 and 7.69, respectively. Hence 
Agi hw 07/4 Agi hw 
i / pple lel (7.75) 


wT a Su Bed i 


Both J and 9? are proportional to |Eo|” so this cancels out, and further 
manipulation yields*® 
nc? 
oO (w) = 3) ae Az oH (w) - (7.16) 
0 
The Lorentzian frequency dependence is expressed by the line shape func- 


tion 1 r 


oo fe ers 


where the subscript H denotes homogeneous, i.e. something that is the 
same for each atom, like the radiative broadening considered here.** The 


gu (w) (7.77) 


39We do not try to include the degen- 
eracy of the levels because illumination 
with intense polarized laser light usu- 
ally leads to unequal populations of the 
states with different M7, or Mp. This 
differs from the usual situation in laser 
physics where the excitation, or pump- 
ing mechanisms, populate all states in a 
given level at the same rate, so Ni/g1 
and N2/g2 can be taken as the pop- 
ulation densities per state. Selective 
excitation of the upper level can give 
Ne2/g2 > Ni/gi, and hence gain. 


40The laws of thermodynamics require 
that an object stays in equilibrium with 
black-body radiation at the same tem- 
perature, hence the absorbed and emit- 
ted powers must balance. 


a Compare this with eqns 1.26, 7.7 and 
7.43. 


42-This assumes that atoms do not get 
rid of their energy in any other way 
such as inelastic collisions. 


43 Intensity is related to the electric 
field amplitude by I = eoc|Eo (w)|” /2, 
and 02 = «7X2, [Bol /h? (eqn 7), 
Also X?, = |Di2|? /3 « Aoi (eqn 7.23). 
The degeneracy factors are gi = g2 = 1 
for the two-level atom, but see eqn 7.79. 


441t is a general result that homoge- 
neous broadening mechanisms give a 
Lorentzian line shape. 
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45 spontaneously emitted photons go in 
random directions so an average over 
angles always occurs in the calculation 
of Aa1 : 


46 Typically, for atoms with a well- 
defined orientation the polarization of 
the light is chosen to give the maximum 
cross-section. If this is not the case then 
the angular momentum algebra may be 
required to calculate the matrix ele- 
ments. Only in special cases would the 
polarization be chosen to give a very 
weak interaction, i.e. a pre-factor much 
less than unity in eqn 7.76. 


area under this line shape function equals unity: 


/ gu (w)dw=1. 


== Os 


(7.78) 


The pre-factor of 3 in eqn 7.76 may have any value in the range 0 to 3. 
It has the maximum value of 3 for atoms with the optimum orientation 
to absorb a beam of polarized laser light (from a specific direction). 
However, if either the light is unpolarized or the atoms have a random 
orientation (i.e. they are uniformly distributed across all the Mj states 
or Mp states) then the pre-factor is 1 because |Xi2|” = |Di2|* /3 as 
in eqn 7.21 (from the average of cos? 6 over all angles), and this 1/3 
cancels the pre-factor of 3.4° Under these conditions the absorption does 
not depend on the magnetic state (My or Mf) so a real atom with 
degenerate levels has a cross-section of 


(7.79) 


This equation, or eqn 7.76, applies to many experimental situations. 
Careful study of the following examples gives physical insight that can 
be applied to other situations.*® 


Example 7.2 Atoms in a specific Mr state interacting with a polarized 
laser beam, e.g. sodium atoms in a magnetic trap that absorb a circularly- 
polarized probe beam (Fig. 7.5) 

This gives effectively a two-level system and the polarization of the light 
matches the atom’s orientation so eqn 7.76 applies (the pre-factor has 
the maximum value of 3). To drive the AMp = +1 transition the 
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Fig. 7.5 The Zeeman states of the 3s ?P,;2 F = 2 and 3p ?P3/2 F’ = 3 hyperfine 
levels of sodium, and the allowed electric dipole transitions between them. The other 
hyperfine levels (F' = 1 and F’ = 0,1 and 2) have not been shown. Excitation of the 
transition F = 2, Mp = 2 to F’ = 3, Mp = 3 (labelled a) gives a closed cycle that 
has similar properties to a two-level atom—the selection rules dictate that atoms in 
the F’ = 3, My = 3 state spontaneously decay back to the initial state. (Circularly- 
polarized light that excites AMp = +1 transitions leads to cycles of absorption and 
emission that tend to drive the population in the F = 2 level towards the state of 
maximum Mp, and this optical pumping process provides a way of preparing a sample 
of atoms in this state.) When all the atoms have the correct orientation, i.e. they 
are in the F = 2, My = 2 state for this example, then eqn 7.76 applies. Atoms in 
this state give less absorption for linearly-polarized light (transition b), or circular 
polarization of the wrong handedness (transition c). 
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circularly-polarized light must have the correct handedness and propa- 
gate along the atom’s quantisation axis (defined by the magnetic field 
in this example).*” 


Example 7.3 The absorption of light on an s-p transition, e.g. the 
3s—3p resonance line of sodium 

Light with a particular polarization and direction drives a transition to 
one magnetic sub-level in the upper level, as shown in Fig. 7.6(a). Since 
the lower level has only m; = 0 there is no averaging over the orientation 
and eqn 7.76 applies. Unpolarized light drives transitions to the three 
upper m; states equally. For each transition the averaging gives a factor 
of 1/3 but all three transitions contribute equally to the absorption so 
the atoms have the same cross-section as for polarized light (eqn 7.79 
with g/g; = 3). Thus the s—p transition is a special case that gives 
the same absorption cross-section whatever the polarization of the light. 
Atoms with m,; = 0 have no preferred direction and interact in the same 
way with light of any polarization (or direction). In contrast, for the p—s 
transition shown in Fig. 7.6(b), atoms in a given m; state only interact 
with light that has the correct polarization to drive the transition to 
my, = 0. 


7.6.1 Cross-section for pure radiative broadening 


The peak absorption cross-section given by eqn 7.76, when w = wp, is 
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Fig. 7.6 A comparison of s—p and p-s transitions. (a) The three transitions a, b 
and c between the s and p levels have equal strength. The physical reason for this is 
that the spontaneous decay rate of the upper m,; states cannot depend on the atom’s 
orientation in space. Light linearly-polarized parallel to the z-axis drives 7-transition 
b only, and spontaneous decay occurs back to the initial state since there are no other 
accessible states—this gives the equivalent of a two-level system. The s—p transition 
is a special case where the absorption does not depend on the polarization, e.g. 
unpolarized light gives equal excitation rates on the three transitions a, b and c, and 
this increases the absorption by the degeneracy factor g2/gi1 = 3, thereby cancelling 
the 1/3 that arises in the orientational average. (b) In contrast, for the p-s transition 
the peak cross-section is one-ninth of that in (a).*% 


47 The direction of the electric field at 
the atom depends on both the polar- 
ization and direction of the radiation, 
e.g. circularly-polarized light that prop- 
agates perpendicular to the quantisa- 
tion axis drives AMp = 0 and +1 
(z- and o-) transitions. This leads to 
a smaller cross-section than when the 
light propagates along the axis. Radi- 
ation that propagates in all directions 
does not produce a polarized electric 
field, e.g. isotropic radiation in a black- 
body enclosure. 


48 spin is ignored here. ‘This applies 
when either the fine structure is not re- 
solved, e.g. this may arise for the tran- 
sition 2s-3p in hydrogen where the fine 
structure of the upper level is small, 
or to transitions between singlet terms, 
i.e. 1S-1P and 'P-'S (with m > M, 
in the figure). 
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49- This equation is very similar to the 
formula for the saturation of gain in a 
homogeneously-broadened laser system 
since gain is negative absorption. 


50Here Isat = hwAoi/(20(wo)) and 
a(wo) is given by eqn 7.81. 


In a two-level atom the upper level can only decay to level 1 so T = Ao, 
and for a transition of wavelength Ay = 27c/wo we find 


2 
o(wo) =3x Sr 2. (7.81) 


This maximum cross-section is much larger than the size of the atom, 
e.g. the Ay = 589nm transition of sodium has o (wo) = 2 x 107-3 m’, 
whereas in kinetic theory the atoms have a cross-section of only md? = 
3x 10718 m~? for an atomic diameter of d = 0.3nm—‘collisions’ between 
atoms and photons have a large resonant enhancement. The optical 
cross-section decreases rapidly off resonance, e.g. light of wavelength 
600nm gives T/ (w—wo) = 10~® for the sodium transition above, so 
that o (w) = 10-1? x o (wo) = 2 x 107-7° m?. Clearly the absorption of 
radiation has little relation to the size of the electronic orbitals. 


7.6.2 The saturation intensity 


In the previous section we calculated the absorption cross-section start- 
ing from eqn 7.73 and we shall now use the same equation to determine 
the population difference; we can write eqn 7.73 as (N; — No) xr = No, 
where the dimensionless ratio r = o (w) I (w) /(hwAg,). This equation 
and N, + No = N give the difference in population densities as 


N N 


N, —- No = —— = —— > : 
i? ee Te ae 
where the saturation intensity is defined by 
hw Ao, 
ds = ; ; 


It is important to note that other definitions of saturation intensity are 
also used, such as the above expression without 2 in the denominator. 
From eqn 7.72 we find that the absorption coefficient depends on inten- 


sity as follows:*9 


Na (w) 
14+ I/I,(w) 
The minimum value of Isat (w) occurs on resonance where the cross- 
section is largest; this minimum value is often referred to as the satura- 
tion intensity, Isa, = Ig (wo), given by”? 


Kk (w,I) = (7.84) 


Tt he 


Tat = g 37? 


(7.85) 
where 7 = I’‘~* is the lifetime for radiative broadening. For example, the 
resonance transition in sodium at \ = 589 nm has a lifetime of rT = 16 ns 
and for an appropriate polarization (as in Fig. 7.5) the atom cycles on 
an effectively two-level transition. This leads to an intensity Isa, = 
60 W m~*, or 6mWcm ~®, that can easily be produced by a tunable dye 
laser. 
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We can also obtain eqn 7.82 directly from the steady-state value of 


w = (N, — N2)/N in eqn 7.68 if the saturation intensity is defined by 


I _ 20? 
ee (7.86) 


This is equivalent to eqn 7.85.°! At saturation the Rabi frequency has a 
value comparable with I. 


7.6.3 Power broadening 


Equation 7.84 for «(w,Z) contains two quantities that vary with fre- 
quency: o(w) and I,(w).°? Rearranging this equation to show the fre- 
quency dependence, and defining 09 = o(wo) as the maximum cross- 
section in eqn 7.81, we find that 


2 
& (w,1) = Nop > _—__ I 12/4 
By) ey ea 
T?/4 
Bias / (7.87) 


(w — wo)” + 412 (141 /Tsat) 


The expression for the absorption coefficient « (w,/) has a Lorentzian 
line shape with a full width at half maximum (FWHM) of 


1 \v2 
~] , 


The line width increases with intensity. This power broadening occurs 
because saturation reduces the absorption near the resonance while far 
from resonance the absorption changes little (see Fig. 7.7). The expres- 
sion for the population in the upper level 22 in eqn 7.69 also has the 
same power-broadened line width, as in eqn 7.88.°° The relationship be- 
tween this absorption and the populations of the two levels is discussed 
in Exercise 7.11. 


Awpwum =T (1 SF (7.88) 


Absorption 
coefficient K(w, I) 


®l as discussed after eqn 7.75, 92/I 
does not depend on the electric field. 


52mm terms of the minimum value 
Tsat = Is (wo) we can rewrite eqn 7.83 
as 
Ts (w) _ 90 
ow)” 


Tsat 
We also have 


es 


eae a 


53- This can be shown by rearrangement 
of the denominator of eqn 7.69 in terms 
of a line width 


ae 


Fig. 7.7 The absorption coefficient 
k(w,J) is a Lorentzian function of the 
frequency that peaks at wo, the atomic 
resonance. Saturation causes the ab- 
sorption line shape to change from the 
curve for a low intensity I < Isat, 
dashed line), to a broader curve (solid 
line), with a lower peak value, described 
by the Lorentzian function in eqn 7.87. 
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Fig. 7.8 The treatment of the inter- 
action of a two-level atom with radi- 
ation by time-dependent perturbation 
theory leads to eqn 7.89, that looks sim- 
ilar to a time-independent perturbation 
(proportional to 2) of two energy lev- 
els with an energy separation of 6. The 
light shift is the difference between the 
unperturbed energies and energy eigen- 
values of the system when it is per- 
turbed by the radiation. 


7.7 The a.c. Stark effect or light shift 


In addition to its effect on the populations, the perturbing radiation also 
changes the energy of the levels and we calculate this light shift in this 
section. We can write eqns 7.41 for ¢, and C2 in matrix form as 


a (&) = (Ge “da) &) 789 


This has solutions of the form 


ey = Gi eit (7.90) 


The equation for the eigenvalues A is 
2 2 


0/2 —d/2—A 

Hence A = + (62 +?)''"/2. For 2 = 0 the unperturbed eigenvalues 
are \ = +6/2, corresponding to two levels 6 apart, as shown in Fig. 7.8. 
This result of time-dependent perturbation theory closely resembles the 
equations for a time-independent perturbation of two states with an 
energy separation of 6 (see Appendix A); the two states are the excited 
state at E. and a level at energy E, + hw, corresponding to the ground 
state plus a photon of the radiation field, see Fig. 7.9. This system of 
atom plus photon is called a ‘dressed atom’—see Cohen-Tannoudji et al. 
(1992) for a much more in-depth treatment. 

Normally light shifts are most important at large frequency detuning 
where the effect of absorption is negligible; in this case |6| > © and the 


eigenvalues are 
Ae (5 + ye ) 7.92 
7 DAO a ae 
The states are shifted from their unperturbed eigenfrequencies by the 
light shift +0?/46. From eqn 7.89 we see that the amplitude ¢ is 
associated with the state with unperturbed energy +6/2 and lies above 
the other state when 6 > 0. This state with amplitude ¢, has a light 
shift of 


1/2 


w 


Ab . 


Aw; a 


(7.93) 


Unperturbed- 


Perturbed 


7.8 


Wy 


2) 9) 
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(c) 
E/h 
|2) 
W9 
Wy 
|1) 
——S—SSD 


V 


Fig. 7.9 Eigenenergies of a two-level atom interacting with an external electric field. (a) and (b) show the a.c. Stark effect 


for negative and positive frequency detunings respectively, as a function of the Rab 
function of the applied field strength. 


This equation is valid for negative frequency detuning 6 < 0 when the 
light shift of this state decreases its energy. The dependence of the light 
shift on the sign of 6 has important consequences for dipole-force traps 
for atoms, as described in Chapter 9. Figure 7.9 summarises the light 
shift and shows the d.c. Stark shift for comparison, and the eigenstates 
of the perturbation are discussed in Appendix A. 


7.8 Comment on semiclassical theory 


This chapter’s treatment of the interaction of radiation with atoms is 
semiclassical—the energy of the atoms is quantised but the radiation is 
not (since Ep cos wt is a classical electric field). It is individual two-level 
atoms that absorb energy in lumps of hw from the radiation; neverthe- 
less, the quantity I/fw is commonly referred to as the flux of photons. 
In addition to transitions between two bound quantum states of the 
atom, this semiclassical theory can also describe photo-ionization where 
light excites an electron from a bound state to an unbound state above 
the ionization limit. In such a transition the atom suddenly becomes an 
ion (plus a free electron) at a given time, like a quantum jump between 
bound states. The average of many such jumps corresponds to the rate 
predicted semiclassically.°* Photo-ionization of individual atoms closely 
resembles the photoelectric effect that occurs at the surface of a metal 
with work function ©® illuminated by light of frequency w. The sur- 


i frequency. (c) The d.c. Stark effect as a 


54cimilarly, in radioactivity the indi- 
vidual nuclei decay randomly but a 
large sample exhibits a smooth expo- 
nential decay. 
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55 The prompt emission of electrons af- 
ter the light hits the surface can also be 
explained semiclassically. 


56in the radio-frequency region the 
damping time can easily be longer than 
the time of measurement (even if it lasts 
several seconds). For allowed optical 
transitions the coherent evolution lasts 
for only a few nanoseconds, but it can 
be observed using short pulses of laser 
radiation. Optical experiments have 
also been carried out with exception- 
ally long-lived two-photon transitions 
(Demtréder 1996). 


face only emits electrons if iw > ®. Semiclassical theory explains this 
observation if the least tightly bound energy level, or energy band, of 
electrons in the metal has a binding energy of ® (cf. the ionization energy 
of atoms). Electrons come off the surface with a maximum kinetic en- 
ergy of hw — ® because the oscillating field resonantly drives transitions 
that have angular frequencies close to w (see eqn 7.15), from a lower 
level in the metal to an upper unbound ‘level’; this explanation does not 
require quantisation of the light into photons, as often implied in ele- 
mentary quantum physics. A common line of argument is that a purely 
classical theory cannot explain various aspects of the photoelectric effect 
and therefore the light must be quantised. The above discussion shows 
that this effect can be explained by quantisation of the atoms.°° 

However, there are phenomena that do require quantisation of the 
radiation field, e.g. in the fluorescence from a single trapped ion it is 
found that two photons have a lower probability of arriving at the de- 
tector together (within the short time period of a measurement) than 
predicted for a totally random source of photons. This spreading out of 
the photons, or anti-bunching, occurs because it takes a time to excite 
the ion again after spontaneous emission. Such correlation of photons, 
or anticorrelation in this case, goes beyond the semiclassical theory pre- 
sented here. Quantitative calculations of photon statistics require a 
fully quantum theory called quantum optics. The book on this subject 
by Loudon (2000) contains more fascinating examples, and also gives 
rigorous derivations of many results used in this chapter. 


7.9 Conclusions 


The treatment of the interaction of atoms with radiation that has been 
described in this chapter forms the foundation of spectroscopy, e.g. laser 
spectroscopy as described in Chapter 8 (and laser physics); it is also 
important in quantum optics. There are a variety of approaches to this 
subject and it is worthwhile summarising the particular route that has 
been followed here. 

The introductory sections closely follow Loudon (2000), and Section 7.3 
gave a terse account of the coherent evolution of a two-level system 
interacting with single-frequency radiation—the atom undergoes Rabi 
oscillations and we saw that the Bloch sphere gives a useful way of 
thinking about the effect of sequences of a- and 7/2-pulses of the atom. 
In Section 7.5 the introduction of damping terms in the equations was 
justified by analogy with a behaviour of a classical dipole oscillator and 
this led to the optical Bloch equations. These equations give a complete 
description of the system and show how the behaviour at short times 
where damping has a negligible effect®® is connected to what happens at 
longer times (much greater than the damping time) when a steady state 
has been established. We found that for radiative damping the system 
settles down to a steady-state solution described by a set of rate equa- 
tions for the populations of the levels—this turns out to be a general 


feature for all broadening mechanisms.°” In particular, this means that 
the steady-state populations for an atom illuminated by monochromatic 
(laser) radiation can be related to the case of illumination by broad- 
band radiation and the rate equations that Einstein wrote down. The 
theory of the interaction between radiation and matter can also be de- 
veloped by working ‘backwards’ from the rate equations in Einstein’s 
treatment of broadband radiation to find the rate equations for the case 
of monochromatic radiation by making ‘reasonable’ assumptions about 
the atomic line shape, etc. This approach avoids perturbation theory 
and is commonly adopted in laser physics; however, it gives no informa- 
tion about coherent phenomena (Rabi oscillations, etc.).°° 

Section 7.6 introduced the concept of an absorption cross-section and 
its use in the calculation of the absorption of radiation propagating 
through a gas with a certain number density of atoms.°? The discussion 
of the cross-section provides a link between two different perspectives 
on the interaction of radiation with matter, namely (a) the effect of 
the radiation on the individual atoms, and (b) the effect of the atomic 
gas (medium) on the radiation, e.g. absorption.®° The saturation of ab- 
sorption as characterised by the saturation intensity forms the basis for 
a method of Doppler-free spectroscopy described in Chapter 8. From 
viewpoint (a), saturation arises because there is a maximum rate at 
which an atom can scatter radiation and this result for individual atoms 
is important in the discussion of radiation forces in Chapter 9. The for- 
malism developed in this chapter allowed a straightforward derivation of 
the a.c. Stark effect on the atomic energy levels; this light shift is used in 
some of the methods of trapping and cooling atoms with laser radiation 
(also described in Chapter 9). 


Further reading 


Loudon’s book on quantum optics gives more rigorous derivations of 
many formulae in this chapter.°! Further properties of the optical Bloch 
equations are discussed by Cohen-Tannoudji et al. (1992) and Barnett 
and Radmore (1997). The treatment of the optical absorption cross- 
section of a gas closely resembles the calculation of the gain cross-section 
for a laser and further details can be found in books on laser physics in- 
cluding detailed discussion of broadening mechanisms, e.g. Davis (1996) 
and Corney (2000). 


7.9 Conclusions 147 


57 Both for mechanisms that are homo- 
geneous like radiative broadening, e.g. 
collisions as shown in Loudon (2000), 
and also for inhomogeneous mecha- 
nisms such as Doppler broadening (Ex- 
Sie (2), 


58 Using time-dependent perturbation 
theory to describe the underlying be- 
haviour of the two-level atom and find- 
ing the rate equations from them gives 
a clear understanding of the conditions 
under which these rate equations for 
the populations are valid. 


59 The inverse of absorption is gain; this 
is a critical parameter in laser systems 
that is calculated in terms of an opti- 
cal cross-section in a very similar way 
to absorption. The gain also exhibits 
saturation. 


eb Rn important objective of this chap- 
ter was to show that both of these view- 
points embody the same physics. 


617 have used similar notation, except 
T for the full width at half maximum 
(FWHM), whereas Loudon uses the 
half width y = 1/2. 
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ie 


Exercises 


(7.1) 


(7.2) 


(7.3) 


Averaging over spatial orientations of the atom 


(a) Light linearly polarized along the z-axis 
gives a dipole matrix element of Xi2 = 
(2| r|1) cos @ sin 6. Show that the average over 
all solid angles gives a factor of 1/3, as in 
eqn 7.21. 


Either show explicitly that the same factor of 
1/3 arises for light linearly polarized along the 
z-axis, E = Epé,coswt, or prove this by a 
general argument. 


Rabi oscillations 


(a) Prove that eqns 7.25 lead to eqn 7.26 and that 
this second-order differential equation has a 
solution consistent with eqn 7.27. 

(b) Plot |co (t)|? for the cases of w — wo = 0, 2 
and 32. 


m- and n/2-pulses 


(a) For zero detuning, w = wo, and the initial con- 
ditions c; (0) = 1 and c2 (0) = 0, solve eqns 
7.25 to find both c; (¢) and cp (t). 


Prove that a m-pulse gives the operation in 
eqn 7.30. 


(c) What is the overall effect of two 7-pulses act- 
ing on |1)? 
(d) Show that a 7/2-pulse gives |1) —- 


{|1) — i]2)} /V2. 

What is the overali effect of two 2/2-pulses 
acting on |1)? When state |2) experiences a 
phase shift of ¢, between the two pulses, show 
that the probabilities of ending up in states 
\1) and |2) are sin?(¢/2) and cos”(¢/2), re- 
spectively. 

(f) Calculate the effect of the three-pulse se- 
quence 1/2-x-7/2, with a phase shift of ¢ 
between the second and third pulses. (The 
operators can be written as 2 x 2 unitary ma- 
trices, although this is not really necessary for 
this simple case.) 


Comment. Without the factors of —i the signals 
in the two output ports of the interferometer are 
not complementary. The fact that the identity op- 
eration is a 4z-pulse rather than 27 stems from 


(7.4) 


(7.5) 


the isomorphism between the two-level atom and 
a spin-1/2 system. 

The steady-state excitation rate with radiative 
broadening 

An alternative treatment of radiative decay simply 
introduces decay terms into eqn 7.25 to give 


(7.94) 


:¢ Q i(w—wo)t 410 
= = I—C2, 
ey == 5° SF 5 2 


—i(w—wo)t 


as Q a 

ie = ci ze ~i5e2. (7.95) 
This is a phenomenological model, i.e. a guess that 
works. The integrating factor exp (Tt/2) allows 


eqn 7.95 to be written as 


d Tt 
atorr(>)} 
= ier F exp { =i € — Wo + >) t} ’ 
2) D 


(a) Show that for 2 = 0 eqn 7.95 predicts that 
Ico (t)|? = len (t = 0)[?e™™. 


(b) For the initial conditions c, (0) = 1 and 
co (0) = 0, integration of eqn 7.94 gives c) ~ 
1. For these conditions and weak excitation 
(Q <T) show that, after a time which is long 
compared to the radiative lifetime, level 2 has 
a steady-state population given by 


2 Q? /4 

|= ae 

(w—wo)° + T?/4 

Saturation of absorption 

The 3s—3p resonance line of sodium has, ['/27 = 
10 MHz and a wavelength of A = 589 nm. 


(a) Sodium atoms in a magnetic trap form a 
spherical cloud of diameter 1mm. The 
Doppler shift and the Zeeman effect of the 
field are both small compared to [. Calculate 
the number of atoms that gives an absorption 
of e~' = 0.37 for a weak resonant laser beam. 


(b) 


Determine the absorption of a beam with in- 
LeNnsitiviel — vente 


(7.6) The properties of some transitions in hydrogen 


The table gives the values of Aoi for transitions 
from the n = 3 shell of hydrogen to lower levels. 
(Spin and fine structure are ignored.) 


Transition Aoi (s~*) 
2p-3s 6.3 x 10° 
1s-3p eT oo" 
2s-3p PO 1) 
2p-3d 6.5 x 10” 


(a) Draw an energy-level diagram for the n = 1,2 
and 3 shells in hydrogen that shows the al- 
lowed electric dipole transitions between the 
orbital angular momentum levels. (Neglect 
transitions for which n does not change, e.g. 
2s-2p.) 

Calculate the lifetimes of the 3s, 3p and 3d 
configurations. What fraction of atoms that 
start in 3p end up in the 2s configuration? 


(b) 


(c) An electron in the 2p configuration has a life- 
time of only 1.6ns. Why is this shorter than 
for the 3p configuration? 


Calculate the radial matrix elements D2 in 
units of ao for the transitions in the table, and 
for 1s—2p. 


(d) 


(e) Calculate Isat for these transitions. 


(7.7) The classical model of atomic absorption 


(a) A simple classical model of absorption as- 
sumes that an electron (in an atom) behaves 
like a damped simple harmonic oscillator of 
charge —e and mass me driven by the oscil- 
lating electric field of the radiation: Eo cos wt. 
The electron’s equation of motion has the form 
of eqn 7.55 with a driving force of constant 
amplitude Fp = —eEo. Find a solution of this 
equation in the form « = Ucoswt — Vsinwt 
(U and Y are not functions of time in the case 
considered here). 


Show that the displacement of the electron has 
an amplitude 


JU? +? = 


Fo/m 
(w? — w)? + (Bw)? 


25 —-1/2 
~ Fe fw uo? F} 


(7.8) 
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Show that the angular frequency w at which 
this amplitude is maximum is very close to wo 
for a narrow resonance. 


(c) Show that the phase is given by 


Y Bu 
tang = Ge 

How does this phase vary as the angular fre- 

quency w increases from w < wo to w > wo? 
(d) For frequencies close to the atomic resonance 
(w ~ wo), show that your expressions for U/ 
and Y can be written in an approximate form 
that agrees with eqns 7.58 and 7.59 (that were 
derived using the slowly-varying envelope ap- 
proximation in which the amplitude of the 
driving force may change slowly over time). 


(e) Show that in the steady state the power P ab- 
sorbed by the electron is a Lorentzian function 


f w: 
of w i 


12 tee i. Loree 
(w — wo)” + (8/2) 


Oscillator strength 

This question shows the usefulness of a dimen- 
sionless parameter called the absorption oscillator 
strength, denoted by fie. 


(a) Show that for the cross-section in eqn 7.79 we 
have 


i o (w) dw = 2? roc fis , (7.96) 


=O" 


where ro = 2.8 x 10° m and fiz = 


Imew D7, / (3h). 


From the simple model of the atom as an os- 
cillating electron in Exercise 7.7, find the clas- 
sical absorption cross-section <1 (w) in terms 
of 6, wo and fundamental constants. 


(c) Without the driving electric field, the oscilla- 
tor undergoes damped harmonic motion © = 
zoe °* cos (w/t — y). The power radiated by 
an oscillating dipole is given by eqn 1.22 (from 
classical electromagnetism (Corney 2000)). 
Determine £. 


(b) 


Show that oi (w) integrated over all frequen- 
cies gives 2m roc. 


(d) 


Comment. This classical value is the maxi- 
mum value for any transition, so fiz <1. The 
absorption oscillator strength is a fraction of 
the integrated cross-section associated with a 
given transition. 
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(7.9) 


(7.10) 


(Gar) 


(e) Calculate fi2 for the 3s-3p transition of 
sodium. (Data in Exercise 7.5.) 


The 3s—3p line of sodium has two components 
corresponding to each of the fine-structure lev- 
els 3p Pisa and 3p Pepe: What is the oscilla- 
tor strength associated with each component. 


(f) 


Calculate the absorption oscillator strength 
for the 1s-2p and 1s—3p transitions in hydro- 
gen using the data in Exercise 7.6. 


(g) 


Doppler broadening 

The Maxwell—Boltzmann distribution of the veloc- 
ities in a gas is a Gaussian function f (v), as de- 
fined in eqn 8.3. Explain why for excitation by 
monochromatic radiation of angular frequency w 
the population in the upper level is given by 


ee 
2 E(w)? 


. ia sin? {(w — wo + kv)t/2} 


(w — wo + kv)? 


lez (¢)|? = 


f (v) du. 


—co 


Assuming that the sinc” in the integrand acts like a 
Dirac delta function (as explained in Section 7.2), 
show that |c2|* is proportional to gp (w) in eqn 8.4. 
Comment. Doppler broadening washes out the 
Rabi oscillations because their frequency depends 
on the velocity, giving an equation similar to that 
for broadband radiation. For all broadening mech- 
anisms Rabi oscillations, and other coherent phe- 
nomena, are only seen on time-scales shorter than 
the reciprocal of the line width. 


An example of the use of Fourier transforms 
Show that an oscillator whose amplitude decays 
exponentially according to roe” ** cos (wt) radiates 
with a Lorentzian power spectrum. 


The balance between absorption and spontaneous 
eMISSLON 

Explain why absorption and the population in the 
upper level are related by 


K (w,I)I = No Aoi hw = N p22 Aoihw ; (7.97) 


(7.12) 


Show that this is consistent with eqns 7.87 and 
7.69 for K(w, I) and p22, respectively. 

Comment. An electric dipole does not radiate uni- 
formly in all directions but this does not matter 
here; only a tiny fraction of the spontaneous emis- 
sion goes along the direction of the incident beam. 
For example, in an experiment to measure the at- 
tenuation of a laser beam as it passes through a 
gas cell, a negligible fraction of the light scattered 
out of the beam falls on the photodetector that 
measures the power after the sample. 


The d.c. Stark effect 

This exercise goes through a treatment of the d.c. 
Stark effect for comparison with the a.c. Stark ef- 
fect. 


(a) An atom with two levels of energies Ey > Ei, 
and a separation of « = E2 — FE; is placed in 
a static electric field. Show that the Hamilto- 
nian for the system has the form 


- ie a 


where the matrix element for the perturbation 
V is proportional to the magnitude of the elec- 
tric field. Find the energy eigenvalues. The 
two levels move apart as shown in Fig. 7.9— 
this is a general feature of systems where a 
perturbation mixes the wavefunctions. 


H 


(b) Show that ‘weak’ fields produce a quadratic 
Stark effect on the atom, equivalent to the 
usual second-order perturbation-theory ex- 


pression for a perturbation Hy: 


esEciabSy ae 


NB = = 
; Eki 


A similar expression can be found for the en- 
ergy shift A> of the other level (in the oppo- 
site direction). 


Estimate the Stark shift for the ground state 
of a sodium atom in a field of 10° Vm~? (e.g. 
10’ V between plates 1 cm apart). 


(c) 


Doppler-free laser 
spectroscopy 


Doppler broadening is usually the dominant contribution to the observed 
width of lines in atomic spectra, at room temperature. The techniques 8.1 Doppler broadening of 


of Doppler-free laser ; spectroscopy overcome this limitation to give much spectral lines 151 
higher resolution than, for example, a Fabry—Perot étalon analyzing the 8.2 The crossed-beam 
light from a discharge lamp, as shown in Fig. 1.7(a). This chapter de- method 153 


scribes three examples that illustrate the principles of Doppler-free tech-__ 8-3 Saturated absorption 
niques: the crossed-beam method, saturated absorption spectroscopy SAREE HY! oe 
and two-photon spectroscopy. To use these high resolution techniques 


for precision measurements of atomic transition frequencies the laser fre- 
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8.5 Calibration in laser 


sapusiedlliie? oie a= ear spectrosco 168 
quency must b be « determined accurately. Thus the calibration is a crucial : oe 
part of laser spectroscopy + experiments, as ¢ discussed at the end of this Further reading 175 
chapter. Since it is important to understand ‘the problem. before looking Exercises 175 


at the solution, the chapter starts with an outline of Doppler broadening 
Oh SI 


8.1 Doppler broadening of spectral lines” 


The relationship between the angular frequency w of radiation in the 


laboratory frame of reference and the angular frequency. seen in a frame 
of reference moving at velocity v, as shown in Fig. 8.1, is 


w =w—kv, (8.1) 


where the wavevector of the radiation has magnitude k = w/c = 27/X. “If necessary, this and other equations 
asneanen in this chapter could be generalised by 


It is the component of the velocity along k that leads to the Doppler replacing kv. with the scalareproduct 
effect and here it has been assumed that k-v = kv. k-v. "i eer 


= 


Atom 
v 


Fig. 8.1 The Doppler effect on the observed frequency of radiation. Radiation that 
has an angular frequency of w in the laboratory frame of reference has the frequencies 
indicated in a reference frame moving with a speed v, e.g. the rest frame of an atom. 
Only the component of the velocity along the wavevector k contributes to the first- 
order Doppler shift. 
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Section 8.3 describes what happens 
when the atoms absorb a range of fre- 
quencies, given by the homogeneous 
width, in addition to any Doppler 
broadening. Absorption is considered 
here because of its relevance to laser 
spectroscopy, but Doppler broadening 
of an emission line arises in the same 
way—atoms emit at wo in their rest 
frame and we see a frequency shift in 
the laboratory. 


3This can easily be shown by differen- 
tiating the Maxwell speed distribution 
which is proportional to v? times the 
velocity distribution, see Table 8.1. 


4Note that aw) di 1 


5 The simple estimate of the FWHM as 
~ 2u/c that leads to eqn 6.38 turns out 
to be quite accurate. 


In this section, we consider the Doppler effect on the absorption by a 
gas where each atom absorbs radiation at frequency wp in its rest frame, 
i.e. when w! = wo.2 Thus atoms moving with velocity v absorb radiation 
when 6 = w — wo = kv, or equivalently 

Ee (8.2) 
Wo (e 


In a gas the fraction of atoms with velocity in the range v to v + dv is 


| M Mv? v" 
f(v) dv = eee exp (-7) dv = —= exp (-5) dv. (8.3) 


) OS av 


Here u = \/2kpT/M is the most probable speed for atoms of mass M 
at temperature 7’.? Relating vu to the frequency via eqn 8.2, we find that 
the absorption has the Gaussian line shape function* 


c ce? (w—wpo 
SS ame 8.4 
9D (w) Uw (mT exp u2 ( wo ) ( ) 


The maximum value occurs at w = wo and the function falls to half its 
maximum value at w — Wo = 61/2, where 


ao , =—in2 
UW oO ; 


The Doppler-broadened line has a full width at half maximum (FWHM) 
of Awp = 26; /2 given by® 


(8.5) 


A 
D = Vin? = ~1.7-. (8.6) 
Wo Cc c 
Kinetic theory gives the most probable speed in a gas as 
ik Pasi u: 
Sa =i ) Soares Doar aren 
u = 2230ms_~ x 300K * (8.7) 


Table 8.1 The characteristic velocities in a gas with a Maxwellian distribution of 


speeds and in an effusive atomic beam; u = \/2kp7'/M, where T is the temperature 
and M is the mass. The extra factor of v in the distribution for a beam, as compared 
to that of a gas, arises from the way that atoms effuse through a small hole of area 
A. Atoms with speed v are incident on a surface of area A at a rate of N(v) vA/4, 
where N(v) is the number density of atoms with speeds in the range v to v + du— 
faster atoms are more likely to pass through the hole. Integration over v leads to 
the well-known kinetic theory result NvA/4 for the flux that arrives at the surface, 
where N is the total number density. The mean speed 0 has a value between the 
most probable and the root-mean-square velocities. 


Gas Beam 
Distribution v exp(—v?/u?) v* exp (—v?/u’) 
Most probable v u J/3/2u 
Root-mean-square velocity 3/2 U JV2u 


In the formula, the atomic mass M must be expressed in atom mass 
units, e.g. M = 1a.m.u. for atomic hydrogen. Numerical values of u are 


given below for hydrogen and a vapour of caesium, both at a temperature 
or = 300K.° 


M(am.u.) u(ms7') Awp/wo Afp (GHz), for 600nm 
H 1 2230 1x 10= 6 
Cs 133 200 10r? 0.5 


The values given for the fractional width Awp/wo show that heavy 


elements have an order of magnitude smaller Doppler width than hy- 
drogen. The Doppler shift of the frequency Afp is also given for a 
pore nett ot O00 an. (This wavelength does not correspond to actual 
transitions.’) These calculations show that Doppler broadening limits 
optical spectroscopy to a resolution of ~ 10° even for heavy elements.® 
The Doppler effect on the absorption of a gas is an example of an 
inhomogeneous broadening mechanism; each atom interacts with the 
radiation in a different way because the frequency detuning, and hence 
absorption. and emission, depend on_ n the velocity of the individual atom. 
In contrast, the radiative broadening by spontaneous decay of the ex- 
cited lev level gives the same natural width for all atoms of the same species 


a 


in a gas—this is a _ homogeneous broadening mechanism.” The difference 


between homogeneous and inhomogeneous broadening i is crucially im- 
portant in laser physics and an extensive discussion and further examples 
_can be found in Davis (1996) and Corney (2000).' 


8.2 The crossed-beam method 


Figure 8.2 shows a simple way to reduce the Doppler effect on a transi- 
tion. The laser beam intersects the atomic beam at right angles. A thin 
vertical slit collimates the atomic beam to give a small angular spread 
a. This gives a spread in the component of the atomic velocity along 
the direction of the light of approximately avpeam. Atoms in the beam 
have slightly higher characteristic velocities than in a gas at the same 
temperature, as shown in Table 8.1, because faster atoms have a higher 
probability of effusing out of the oven. Collimation reduces the Doppler 


broadening to 
QVpeam 


r 


where Afp is the Doppler width of a gas at the same temperature as 
the beam." 


Af ~ aA fp , (8.8) 


Example 8.1 Calculation of the collimation angle for a beam of sodium 
that gives a residual Doppler broadening comparable with the natural 
width Afy = 10 MHz (for the resonance transition at A = 589 nm) 
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®In this table Afp = 1.7u/A. 


"The Doppler widths of optical transi- 
tions in other elements normally lie be- 
tween the values for H and Cs. A useful 
way to remember the correct order of 
magnitude is as follows. The speed of 
sound in air is 330ms~! (slightly less 
than the speed of the air molecules). 
The speed of sound divided by the 
speed of light equals 10~®. Multipli- 
cation by a factor of 2 converts the 
half-width to a FWHM of Awp/w, ~ 
2x 10~®, which gives a reasonable esti- 
mate of the fractional Doppler shift of 
medium-heavy elements. 


8The resolving power of a Fabry—Perot 
étalon can easily exceed ~ 10° (Brooker 


~ 2003), so that normally the instrumen-_ 


tal width does not limit the resolution, 
in the visible region. W-B. 


° The different characteristics of the two 
types of broadening mechanism are dis- 
cussed further in this chapter, e.g. see 
Fig. 8.3. 


10The treatment of the saturation of 
gain in different classes of laser system 
is closely related to the discussion of 
saturation of absorption, both in prin- 
ciple and also in the historical develop- 
ment of these subjects. 


11.4 numerical factor of 0.7 ~ ILA 
has been dropped. To obtain a precise 
formula we would have to consider the 
velocity distribution in the beam and 
its collimation—usually the exit slit of 
the oven and the collimation slit have 
comparable widths. 
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Fig. 8.2 Laser spectroscopy of a colli- 
mated atomic beam. The component 
of atomic velocity along the laser beam 
has a small spread avpeam, where a is 
the collimation angle shown on the plan 
view in (b). 


12From eqn 7.50, Af ~ 1/At. 


Detector 


Atomic 
beam 


Sodium vapour at 1000K has a Doppler width of Afp = 2.5GHz and 
the most probably velocity in the beam is Upeam © 1000ms~!. Thus a 
suitable collimation angle for a beam of sodium effusing from an oven 
at this temperature is 


Afn 10 3 
Se 5500 rad. (8.9) 
This angle corresponds to a slit 1mm wide positioned 0.25m from the 
oven. Collimation of the beam to a smaller angular spread would just 
throw away more of the atomic flux to give a weaker signal without 
reducing the observed line width. 

In this experiment, atoms interact with the light for a time At ~ 
d/Vpeam, Where d is the laser beam diameter. The finite interaction time 
leads to a spread in frequencies called transit-time broadening.!” For a 
laser beam of diameter 1mm we find 

Ubeam 1000 


Afy = "= Th-3 & IMHz. (8.10) 


Thus this broadening mechanism does not have a significant effect in 
this experiment, compared to the natural width of the optical transi- 
tion. (Transit time is an important consideration for the radio-frequency 
measurements with atomic beams described in Section 6.4.2.) Collision 
broadening has a negligible effect in the experiment shown in Fig 8.2 
because of the low density of atoms in the atomic beam and also in 
the background gas in the vacuum chamber. An atomic-beam appa- 
ratus must have a high vacuum since even a glancing collision with a 


8.3 Saturated absorption spectroscopy 155 


background gas molecule deflects an atom out of the highly-collimated 
beam. 


Example 8.2 Figure 6.12 in the chapter on hyperfine structure showed 
a spectrum of tin (Sn) obtained by the crossed-beam technique that illus- 
trated isotope and hyperfine structure. Comparison with the Doppler- 
broadened spectrum emitted by a cadmium lamp clearly showed the 
advantages of the crossed-beam technique.!® 


Experimenters used highly-monochromatic light sources to demon- 
strate the principle of the crossed-beam method before the advent of 
lasers, but the two other techniques described in this chapter rely on the 
high-intensity and narrow-frequency bandwidth of laser light. 


8.3 _Saturated absorption spectroscopy 


We derived the line shape for Doppler broadening in Section 8.1 on the 
assumption that_an atom at rest absorbs radiation exactly_at wo. In 
reality the atoms absorb radiation over a range of frequencies given by 


the homogeneous width of the transition, e.g. the line width [caused _by_ 


radiative proad ening. In this section we eaell reconsider absorption of 
ing eee the ir ir nhomogeneous. nee TT by. the > atom’s 
motion. This approach leads naturally into a discussion_of saturated 
absorption spectroscopy. 

We consider a laser beam_of intensity J (w) that travels through a 
sample of of atoms, as shown in in Fig. 7.4. In this chapter we consider ihe 
atoms as moving, whereas previously t. they. were taken to be e stationary.'4 
Atoms witl with velocities in ‘the velocity class v tov + dv see radiation with 
an effective frequency ¢ of w — kv in their rest fre frame, and for those atoms 
the absorption cross-section is (Ww — kv), defined in eqn 7.76. The 
number density of atoms in this velocity class is N(v) = Nf(v), where 
N is the total number density of the gas (in units of atoms m~*) and 
the distribution f(v) is given in eqn 8.3. Integration of the contributions 
from all the velocity classes gives the absorption coefficient as 


i v) o (w — kv) du 


-- is m px v) gu (w — kv) du (8.11) 
_ 9 me Ce [/ (2m) 
a pe An XN I) orp - bu) 2/4 


The integral is the convolution of the Lorentzian function gy (w — kv) 
and the Gaussian function f (v).1° Except at very low temperatures the 
homogenous width is much less than the Doppler broadening, [ < Awp, 
so that the Lorentzian is sharply peaked and acts like a delta function 


eerie spacings between the lines from 
different isotopes does not depend on 
the angle between the laser beam and 
the atomic beam, but for absolute mea- 
surements of transition frequencies the 
angle must be accurately set to 90°. 


14a room temperature, the Doppler 


width usually exceeds natural and 
other homogeneous broadening mech- 
anisms. Very cold atomic vapours in 
which the Doppler shifts are smaller 
than the natural width of allowed tran- 
sitions can be prepared by the laser 
cooling techniques described in Chap- 
ter 9. 


151 general, the convolution leads to 
a Voigt function that needs to be cal- 
culated numerically (Corney 2000 and 
Loudon 2000). 
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16This is a convolution of the solution 
for a stationary atom with the velocity 
distribution (cf. Exercise 7.9). 


17 The cross-section only has a signifi- 
cant value near wo, so taking the lower 
limit of the integration to be 0 (which is 
realistic) or —oo (which is easy to eval- 
uate) makes little difference. 


gu (w — kv) = 6 (w — wo — kv) that picks out atoms moving with velocity 


(8.12) 


Integration over v transforms f (v) into the Gaussian line shape function 
in eqn 8.4:16 


ee / f@oa Gee (8.13) 


Thus since k(w) = No(w) (from eqn 7.70) we find from eqn 8.11 that 
the cross-section for Doppler-broadened absorption is 


22 
oO (w) = FF we Agi 9D (w) 4 ; (8.14) 


Integration of gp (w) over frequency gives unity, as in eqn 7.78 for ho- 
mogeneous broadening. Thus both types of broadening have the same 
integrated cross-section, namely!’ 


/ o (w) dw = v2 La (8.15) 
0 mn 4 


The line broadening mechanisms spread this integrated cross-section out 
over a range of frequencies so that the peak absorption decreases as 
the frequency spread increases. The ratio of the peak cross-sections 
approximately equals the ratio of the line widths: 


[o(o)IDoppter _ 9p (Wo) _ aa FP (8.16) 


[7 (Wo) Homog gu (Wo) Awp 


The numerical factor Vz ln 2 = 1.5 arises in the comparison of a Gaus- 
sian to a Lorentzian. For the 3s—3p resonance line of sodium [/2x7 = 
10 MHz and at room temperature Awp/27a = 1600 MHz, so the ratio of 
the cross-sections in eqn 8.16 is ~ 1/100. The Doppler-broadened gas 
gives less absorption, for the same N, because only 1% of the atoms 
interact with the radiation at the line centre—these are the atoms in 
the velocity class with v = 0 and width Av ~ T'/k. For homogeneous 
broadening all atoms interact with the light in the same way, by defini- 
tion. 


-3.1 Principle of saturated absorption. 
spectroscopy 


SO ae i 


This method of laser spectroscopy exploits the saturation of absorption 
to give a Doppler- -free signal. At high | intensities the population differ- 


ence between two levels is reduced as atoms are excited to the upper 


level, and we account for this by modifying eqn 8.11 to read 


re es je {M1 (v) — No (v)} oabs (w — kv) dv. (8.17) 


8.3 


This is the same as the modification we made in going from eqn 7.70 
to 7.72 but applied to each velocity class within the distribution. Here 
Nj (v) and N2(v) are the number densities in levels 1 and 2, respec- 
tively, for atoms with velocities between v and v+ dv. At low intensities 


almost all the atoms stay in level 1, so N; (inne N(v) has the Gaussian 
distribution in eqn 8.3 and No ~ 0, as illustrated in Fig. 8.3(a). For 
all intensities, the integral of the number densities in each velocity class 
equals the total number density ‘in that level, i. ue 


macensceamnas 


ae N, (v) dv = Ny, (8.18) 
mee et A 

and similarly for No. The total number density N = Ni + No. = 

“Tn saturated absorption spectroscopy t the he quantity tity Ni(v) — = - No(v ee 


affected by interaction with a strong laser beam, as shown in Fig. 8:3(b) 
and F Fig. 8.4 shows a vs a typical experimental a | arrangement. t. The beam split- 
ter divides the power of the laser beam between a_\ weak p sak probe. anda 
stronger pump beam.'? Both these beams have the same frequency w 
and the two beams go in opposite directions through the sample cell 
containing the a atomic vapour. The pump beam interacts with atoms 


that have velocity v = (w — wolf k and excites many of them into the 
upper level, as shown in Fig. 8.3(b). This is referred to as hole burning. 


BBs viele 


The hole burnt into the lower- level population by a beam of intensity [ 


1 \1/2 
Nojioie = T (1 oe ) ’ 
Tat 


has_a width 
equal to the power-broadened homogeneous width in eqn 7.88. 


When tl the laser has a frequency far from resonance, |w — wo| > Awnhole, 


the pump a eric) probe | beams interact with different atoms so the pump 
beam does not_affect the probe beam, as illustrated on t the left- and 


(8.19) 


(a) (b) 
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18 This treatment of saturation is re- 
stricted to two-level atoms. Real sys- 


cult to treat since, under conditions _ 
with ‘signification sat saturation of the ab-_ 


sorption, the atoms are usually not 


uniformly distributed over the sub- 


levels (unless the light is unpolarized). 
Nevertheless, the expression Ni(v) — 
91 1.Na(v)/g2 is often used for the differ- 
ence in popu ation densities s in a given 
velocity ‘class. 


HO qe wwe have Torovea < daapana 
Tpump Z > Isat: 


Fig. 8.3 The saturation of absorption. 
(a) A weak beam does not significantly 
alter the number density of atoms in 
each level. The number density in the 
lower level Ni(v) has a Gaussian dis- 
tribution of velocities characteristic of 
Doppler broadening of width Awp/k. 
The upper level has a negligible popu- 
lation, No(v) ~ 0. (b) A high-intensity 
laser beam burns a deep hole—the pop- 
ulation difference J Ni(v) — No(v) tends 
to zero for the atoms ; that interact most_ 
strongly with the light {those with ve- 
locity v = (w— wo)/k). Note that Ni(v) 
does not tend to zero: strong pumping 
of _a two-level system never gives popu- 
lation inversion. This figure also shows 
clearly that_ ioe broadening is in- 


homogeneous so that_atoms interact in 
different_ways within the radiation. 
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(a) 


Sample Detector 


(b) 


Signal without 
pump beam 


Signal with 
pump beam 


Fig. 8.4 (a) A saturated absorption spectroscopy experiment. The beam splitter BS, e.g. a piece of glass, divides the laser 
power between a weak probe and a stronger pump beam. The figure shows a finite angle of intersection between the weak probe 
beam and the stronger pump beam in the sample; this arrangement makes it straightforward to detect the probe beam after 
the cell but it leaves some residual Doppler broadening. Therefore saturated absorption experiments often have the pump and 
probe beams exactly counter-propagating and use a partially-reflecting mirror at ML to transmit some of the probe beam to the 
detector (while still reflecting enough of the pump beam). (b) A plot of the probe intensity transmitted through the sample as 
a function of the laser frequency. With the pump beam blocked the experiment gives a simple Doppler-broadened absorption, 
but in the presence of the pump beam a narrow peak appears at the atomic resonance frequency. (c) The population densities 
of the two levels Ni (v) and N2(v) as a function of velocity for three different laser frequencies: below, equal to, and above the 
atomic resonance, showing the effect of the pump and probe beams. 
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right-hand sides of Fig. 8.4(c). Close to resonance, w ~ wo, both beams 


interact_with atomis in the velocity class with v= 0, and the hole burnt 
by the pump beam reduces the absorption of th the e probe beam. Thus 
saturation of the absorption, by the pump beam leads to a narrow peak 
in the intensity of the probe beam transmitted through the sample, as 
shown in Fig. 8.4(b). Normally, the » pump | beam has an intensity of about 
the saturation intensity Isat, so the saturated absorption peaks always 
have_a line width greater than the natural width. The velocity class of 
atoms that interact with the light has a velocity spread Av = Awnole/k. 

This section shows how saturation spectroscopy picks out a ‘signal from 


the atoms - in ‘the velocity class centred at v = 0 to give a signal at the 


atomic resonance frequency. . It is the homogeneous broadening of these 


stationary atoms that determines the widths of the peaks. Exercise 8.8 
goes through a detailed calculation of this width. Many experiments use 
this Doppler-free technique to give a stable reference, e.g. to set the laser 
frequency a few line widths below resonance in laser cooling experiments 
with the optical molasses technique (described in the next chapter).?° 


8.3.2 Cross-over resonances in saturation 
spectroscopy 


In a saturated absorption spectrum, peaks appear at frequencies midway 
between pairs of transitions that have energy levels in common (and a 
separation less than the Doppler width), e.g. for the three-level atom 
shown in Fig. 8.5(a). To explain these cross-over resonances we need to 
consider the situation shown in Fig. 8.5(b), where the pump beam burns 
two holes in the velocity distribution. These holes give rise to two peaks 
in the spectrum when the laser frequency corresponds to the frequencies 
of the two transitions—the ‘expected’ saturated absorption signals for 
these two transitions. However, an additional peak appears when the 
hole burnt by one transition reduces the absorption for the other transi- 
tion. As illustrated by Fig. 8.5(b), the symmetry of this situation means 
that cross-overs occur exactly midway between two saturated absorption 
peaks. This property allows experimenters to identify the cross-overs in 
a saturated absorption spectrum (see the exercises at the end of this 
chapter), and these extra peaks do not generally cause confusion. 

The spectral lines of atomic hydrogen have large Doppler widths be- 
cause it is the lightest element, but physicists want to measure the energy 
levels of this simple atom precisely to test atomic physics theory and to 
determine the Rydberg constant. 

Figure 8.6 shows a spectrum of the Balmer-a line (n = 2 ton = 3) that 
is limited by Doppler broadening. This red line of atomic hydrogen, at a 
wavelength of \ = 656nm, has a Doppler width of Afp = 6 GHz at room 
temperature (Section 8.1); this is less than the 11 GHz interval between 
the 7 = 1/2 and 3/2 fine-structure levels in the n = 2 shell. Using 
the isotope deuterium (which has twice the atomic mass of hydrogen) 
in a discharge cooled to 100K reduces the Doppler width to Afp = 
2.3 GHz,”! where one factor arises from the mass and the other from the 


20 Nowadays, inexpensive semicon- 
ductor diode lasers make saturation 
spectroscopy a feasible experiment in 
undergraduate teaching laboratories, 
using the alkali elements rubidium or 
caesium that have sufficient vapour 
pressure at room temperature that a 
simple glass cell can be used as the 
sample (Wieman et al. 1999). 


21 As calculated by scaling the value for 
hydrogen. The ratio of the Doppler 
widths for H at T = 300K and D at 
T= WOK is /6= v2 = v2. 
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a a 2 v= a 
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Fig. 8.5 The formation of a cross-over resonance. (a) A three-level atom with two allowed transitions at angular frequencies wi 
and wig. (b) A cross-over resonance occurs at X, midway between two saturated absorption peaks corresponding to transitions 
at, angular frequencies wi2 and wi3. At the cross-over the hole burnt by the pump beam acting on transition 1 + 2 reduces 
the probe beam absorption on transition 1 +> 3, and vice versa. 


22 The expectation value of the spin 
orbit interaction scales as 1/n? (in 
eqn 2.56) and hence the splitting for 
je ois 8/27 = 03 times that fox 
= 2h 


temperature. This makes it possible to observe components separated 
by the 3.3 GHz interval between the j = 1/2 and 3/2 fine-structure levels 
in the n = 3 shell—see Figs 8.6(c) and 8.7(a).?? Structure on the scale 
of the 1GHz corresponding to the Lamb shift cannot be resolved by 
conventional Doppler-limited techniques. 

Figure 8.7 shows the spectacular improvement in resolution obtained 
with Doppler-free spectroscopy. The saturated absorption spectrum 
shown in Fig. 8.7(c) was obtained in a room-temperature discharge of 
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atomic hydrogen and part of the spectrum in Fig. 8.6(b) is shown for 
comparison.”? The saturated absorption technique gives clearly resolved 
peaks from the 2a and 2b transitions with a separation equal to the Lamb 
shift—the QED contributions shift the energy of the 2s 7S, /2 level up- 
wards relative to 2p 7P, j2. Lamb and Retherford had measured this 
shift by a radio-frequency method using a metastable beam of hydrogen 


Fig. 8.6 Spectroscopy of the Balmer-a 
line, carried out with an apparatus sim- 
ilar to that in Fig. 1.7(a), has a resolu- 
tion limited by the Doppler effect. (a) 
The transmission peaks of a pressure- 
scanned Fabry-Perot étalon obtained 
with a highly-monochromatic source 
(helium—neon laser). The spacing of 
the peaks equals the free-spectral range 
of the étalon given by FSR = 1/2! = 
1.68cm—!, where / is the distance be- 
tween the two highly-reflecting mirrors. 
The ratio of the FSR to the width of 
the peaks (FWHM) equals the finesse 
of the étalon, which is about 40 in this 
case. (The difference in height of the 
two peaks in this trace of real data 
arises from changes in laser intensity 
over time.) In all the traces, (a) to (c), 
the étalon was scanned over two free- 
spectral ranges. (b) The spectrum from 
a discharge of hydrogen, H, and deu- 
terium, D, at room temperature. For 
each isotope, the two components have 
a separation approximately equal to the 
interval between the fine-structure lev- 
els with n = 2. This splitting is slightly 
larger than the Doppler width for hy- 
drogen. The isotope shift between the 
hydrogen and deuterium lines is about 
2.5 times larger than the free-spectral 
range, so that adjacent peaks for H 
and D come from different orders of 
the étalon. (The étalon length has 
been carefully chosen to avoid overlap 
whilst giving high resolution.) (c) The 
spectrum of hydrogen and deuterium 
cooled to around 100K by immersing 
the discharge tube in liquid nitrogen. 
(The relative intensities change with 
discharge conditions.) The fine struc- 
ture of the 3p configuration is not quite 
resolved, even for deuterium, but leads 
to observable shoulders on the left of 
each peak—the relevant energy levels 
are shown in Fig. 8.7. Courtesy of Dr 
John H. Sanders, Physics department, 
University of Oxford. 


22-The first saturated absorption spec- 
trum of hydrogen was obtained by Pro- 
fessor Theodor Hansch and co-workers 
at Stanford University (around 1972). 
In those pioneering experiments the 
width of the observed peaks was lim- 
ited by the bandwidth of the pulsed 
lasers used. Continuous-wave lasers 
have lower bandwidth. 


162 Doppler-free laser spectroscopy 


Fig. 8.7 Spectroscopy of the Balmer- 
a transition. (a) The levels with prin- 
cipal quantum numbers n = 2 and 
nm = 3 and the transitions between 
them. Relativistic quantum mechanics 
(the Dirac equation) predicts that en- 
ergies depend only on n and j, lead- 
ing to the five transitions labelled 1 
to 5 in order of decreasing strength 
(proportional to the square of the ma- 
trix element). In reality, some of these 
levels are not degenerate because of 
QED effects, e.g. the Lamb shift be- 
tween 2s 7S /2 and 2p ?Pj/2 that gives 
two components in transitions 2 and 
3. Thus there are seven optical transi- 
tions (that were listed in Section 2.3.5). 
(The allowed transition between the 
2s 71/2 and 2p Pz levels, and other 
radio-frequency transitions are not 
marked.) (b) The Doppler-broadened 
profile of the Balmer-a line in a 
room-temperature discharge containing 
atomic hydrogen shows only two clear 
components separated by about 10 GHz 
(slightly less than the fine-structure 
splitting of the 2p configuration—see 
the caption of Fig. 8.6). (c) The sat- 
urated absorption spectrum obtained 
with a continuous-wave laser. The 
Lamb shift between the 2a and 2b 
components is clearly resolved. The 
2s ?S1/2 level has a hyperfine split- 
ting of 178 MHz and this leads to the 
double-peaked profile of 2a, 3a and 
the cross-over resonance X midway be- 
tween them. (In addition to their rel- 
ative positions, further evidence that 
peak X is the cross-over resonance be- 
tween 2a and 3a comes from their simi- 
lar line shape, which strongly indicates 
that they share a common level; the 
weak transition 3b is obscured.) Tran- 
sition 4 is also seen on the far left and 
the cross-over resonance between 4 and 
1 is just visible as a small bump on the 
base of peak 1. The scale gives the laser 
frequency relative to an arbitrary point 
(transition 1). Data shown in (c) was 
obtained by Dr John R. Brandenberger 
and the author. 


281/2 
2P 1/27 


(b) 


Doppler-broadened 
profile of hydrogen 


5) 41 3a 3b 2a2b 
|} 


—+»> (¢— Lamb shift 


(c) 


Saturation 
spectrum 


—§ 0 5 10 15 
Relative laser frequency (GHz) 


(atoms in 2s 7S, /2 level) but it was not resolved by optical techniques 
before the invention of Doppler-free laser spectroscopy. 


8.4 ‘Two-photon spectroscopy 


Two-photon spectroscopy uses two counter-propagating laser beams, as 
shown in Fig. 8.8. This arrangement has a superficial similarity to sat- 
urated absorption spectroscopy experiments (Fig. 8.4) but these two 
Doppler-free techniques differ fundamentally in principle. In two-photon 
spectroscopy the simultaneous absorption of two photons drives the 
atomic transition. If the atom absorbs one photon from each of the 
counter-propagating beams then the Doppler shifts cancel in the rest 
frame of the atom (Fig. 8.9(a)): 


(8.20) 


Beam splitter 
sends light 
to calibration 


T Detector 


Fig. 8.8 A two-photon spectroscopy experiment. The lens focuses light from the 
tunable laser into the sample and a curved mirror reflects this beam back on itself 
to give two counter-propagating beams that overlap in the sample. For this exam- 
ple, the photons spontaneously emitted after a two-photon absorption have different 
wavelengths from the laser radiation and pass through a filter that blocks scattered 
laser light. Usually, only one of the wavelengths corresponding to the allowed tran- 
sitions at frequencies w3; or w;2 (in the cascade shown in Fig. 8.9(a)) reaches the 
detector (a photomultiplier or photodiode). The beam splitter picks off some laser 
light to allow measurement of its frequency by the methods discussed in Section 8.5. 


(a) 

Laboratory 

Atom 

frame: w (1-2) w (1+ 2) 
(b) 


8.4 Two-photon spectroscopy 163 


Fig. 8.9 (a) The atom has a compo- 
nent of velocity v along the axis of the 
laser beams (the light has frequency 
w). The atoms sees an equal and op- 
posite Doppler shift for each beam. 
So these shifts cancel out in the sum 
of the frequencies of the two counter- 
propagating photons absorbed by the 
atom (eqn 8.20). The sum of the fre- 
quencies does not depend on v so reso- 
nance occurs for all atoms when 2w = 
wi2. {b) A two-photon transition be- 
tween levels 1 and 2. The atom decays 
in two steps that each emit a single pho- 
ton with frequencies w;2 and w};. 
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Fig. 8.10 A stimulated Raman transi- 
tion between levels 1 and 2, via a virtual 
level. Level 2 is not resonantly excited 
in this coherent process. 


When twice the laser frequency w equals the atomic resonance frequency 
2w) = w 2 all the atoms can absorb two photons; whereas in saturation 
spectroscopy the Doppler-free signal comes only from those atoms with 
zero velocity. 

For the energy-level structure shown in Fig. 8.9(b) the atom decays 
in two steps that each emit a single photon (following the two-photon 
absorption). Some of these photons end up at the detector. A brief con- 
sideration of this cascade process illustrates the distinction between a 
two-photon process and two single-photon transitions. It would be pos- 
sible to excite atoms from 1 to 2 using two laser beams with frequencies 
Wry = Wy; and Wrz = wj2 resonant with the two electric dipole transi- 
tions, but this two-step excitation has a completely different nature to 
the direct two-photon transition. The transfer of population via the in- 
termediate level i occurs at the rate determined by the rates of the two 
individual steps, whereas the two-photon transition has a virtual inter- 
mediate level with no transitory population in 7. (Equation 8.20 shows 
that to get a Doppler-free signal the two counter-propagating beams 
must have the same frequency.) This distinction between single- and 
two-photon transitions shows up clearly in the theory of these processes 
(see Section E.2 of Appendix E) and it is worthwhile to summarise some 
of the results here. Time-dependent perturbation theory gives the rate 
of transitions to the upper level induced by an oscillating electric field 
Eo coswt. The calculation of the rate of two-photon transitions requires 
second-order time-dependent perturbation theory. Resonant enhance- 
ment of the second-order process occurs when 2w = wj2 but this still 
gives a rate which is small compared to an allowed single-photon tran- 
sition. Therefore, to see any second-order effects, the first-order terms 
must be far off resonance; the frequency detuning from the intermediate 
level w — w1; must remain large (of the same order of magnitude as w; 
itself, as drawn in the Fig. 8.9(b)). Two-photon absorption has many 
similarities with stimulated Raman scattering—a process of simultane- 
ous absorption and stimulated emission of two photons via a virtual 
intermediate level, as shown in Fig. 8.10 (see Appendix E). 

Finally, although the difference between two sequential electric dipole 
transitions (E1) and a two-photon transition has been strongly empha- 
sised above, these processes do link the same levels. So from the E1 
selection rules (Al = +1 between levels of opposite parity) we deduce 
the two-photon selection rules: Al = 0,+2 and no change of parity, e.g. 
s-s or s—d transitions. 

Two-photon spectroscopy was first demonstrated on the 3s—4d transi- 
tion of atomic sodium which has a line width dominated by the natural 
width of the upper level. The 1s—2s transition in atomic hydrogen has 
an extremely narrow two-photon resonance, and the line width observed 
in experiments arises from the various broadening mechanisms that we 
study in the next section. 


Example 8.3 Two-photon spectroscopy of the ls—2s transition in atomic 
hydrogen 
The 1s-2s two-photon transition in atomic hydrogen has an intrinsic 
natural width of only 1 Hz because the 2s configuration is metastable. 
An atom in the 2s energy level has a lifetime of 1/8s, in the absence of 
any external perturbations, since there are no p configurations of signifi- 
cantly lower energy (see Fig. 2.2).24 In contrast, the 2p configuration has 
a lifetime of only 1.6 ns because of the strong Lyman-a transition to the 
ground state (with a wavelength of 121.5nm in the vacuum ultraviolet). 
This huge difference in lifetimes of the levels in n = 2 gives an indica- 
tion of the relative strengths of single- and two-photon transitions. The 
1s—2s transition has an intrinsic quality factor of Q = 10°, calculated 
from the transition frequency 3cRox divided by its natural width. To 
excite this two-photon transition the experiments required ultraviolet 
radiation at wavelength \ = 243 nm.”° 

Figure 8.11 shows a Doppler-free spectrum of the 1s~—2s transition. A 
resolution of 1 part in 10!° has not yet been achieved because the various 
mechanisms listed below limit the experimental line width. 


(a) Transit time Two-photon absorption is a nonlinear process?® with 
a rate proportional to the square of the laser beam intensity, I? (see 
Appendix E). Thus to give a high signal experimenters focus the 
counter-propagating beams down to a small size in the sample, as 
indicated in Fig. 8.8. For a beam diameter of d = 0.5mm transit- 
time broadening gives a contribution to the line width of 
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(8.21) 


where w is a typical velocity for hydrogen atoms (see eqn 8.7). 


Collision broadening (also called pressure broadening) Collisions 
with other atoms, or molecules, in the gas perturb the atom (inter- 
acting with the radiation) and lead to a broadening and frequency 
shift of the observed spectral lines. This homogeneous broadening 
mechanism causes an increase in the line width that depends on the 
collision rate 1/Teo, where Teo is the average time between colli- 
sions. In a simple treatment, the homogeneous width of a transition 
whose natural width is [ becomes Awhomog = F+2/Teon = [+2No%, 
where a is the collision cross-section and 0 is the mean relative ve- 
locity, as described by Corney (2000)—see also Loudon (2000) or 
Brooker (2003). The number density of the perturbing species N is 
proportional to the pressure. For the 1s—2s transition frequency the 
pressure broadening was measured to be 30GHz/bar for hydrogen 
atoms in a gas that is mostly hydrogen molecules (Hz), and this 
gives a major contribution to the line width of the signal shown in 
Fig. 8.11 of about 8 MHz (at the frequency of the ultraviolet radia- 
tion near 243nm).?” Some further details are given in Exercise 8.7. 


(c) Laser bandwidth ‘The first two-photon experiments used pulsed 
lasers to give high intensities and the laser bandwidth limited the 
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24The microwave transition from the 
2s ?S1/2 level down to the 2p ?P1/9 
level has negligible spontaneous emis- 
sion. 


25 Such short-wavelength radiation can- 
not be produced directly by tunable 
dye lasers, but requires frequency dou- 
bling of laser light at 486 nm by second- 
harmonic generation in a nonlinear 
crystal (a process that converts two 
photons into one of higher energy). 
Thus the frequency of the laser light 
(at 486nm) is exactly one-quarter of 
the 1s—2s transition frequency (when 
both the factors of 2 for the frequency- 
doubling process and two-photon ab- 
sorption are taken into account); thus 
the laser light (at 486nm) has a fre- 
quency very close to that of the Balmer- 
B line (n = 2 to n = 4) because the 
energies are proportional to 1/n? in hy- 
drogen. 


6In contrast, single-photon scattering 
well below saturation is a linear process 
proportional to the intensity 7. Satu- 
rated absorption spectroscopy is a non- 
linear process. 


27 Collisions shift the 1s-2s_ transi- 
tion frequency by —9GHz/bar and 
this pressure shift is more troublesome 
for precision measurements than the 
broadening of the line (see Boshier et al. 
1989 and Mcintyre et al. 1989). 
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Fig. 8.11 (a) The hyperfine struc- 
ture of the 1s and 2s configurations 
of hydrogen (not to scale). The two- 
photon transitions obey the selection 
rule AF = 0. This allows the tran- 
sitions) Ha —06LO eH. ——Ohand B= 1 
to F’ = 1. (b) A two-photon spec- 
trum of the 1s—2s transition in atomic 
hydrogen. The recorded signal comes 
from photons emitted from the gas (fol- 
lowing the two-photon excitation) that 
are detected by a photomultiplier, as 
shown in Fig. 8.8. This signal arises in 
a slightly different way to that shown in 
Fig. 8.9(b): the 2s configuration in hy- 
drogen decays very slowly since it has 
no allowed transition to 1s but transfer 
from 2s to 2p occurs by collisions with 
atoms (or molecules) in the gas, and 
the 2p configuration decays rapidly by 
the emission of Lyman-a photons. The 
scale gives the (relative) frequency of 
the ultraviolet radiation used to excite 
the two-photon transition (Foot et al. 
1985). Copyright 1985 by the Ameri- 
can Physical Society. 
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resolution. A laser with a pulse of duration 7 = 10ns has a lower 
limit to its bandwidth of 


IL 
Afi > . ~ 100 MHz. (8.22) 


This Fourier transform limit assumes a perfectly shaped pulse and 
in practice pulsed lasers typically have a bandwidth an order of 
magnitude greater. Commercial continuous-wave dye lasers have 
bandwidths of 1MHz but researchers use sophisticated electronic 
servo-control systems to reduce this. The best ion trap experiments 


use a laser system that generates ultraviolet radiation with a band- 
width of only a few Hz to give a resolution approaching 1 in 10!°. 


(d) Second-order Doppler effect Two-photon spectroscopy eliminates 
the first-order Doppler effect but not the second-order term that 
corresponds to time dilation in special relativity, namely 


2 
ke = fo = 0.1MHz. (8.23) 


For hydrogen u/c = 7 x 10~® (see Section 8.1) and fo = 2.5x 10! Hz 
for the 1s -2s transition. 

Time dilation depends on the square of the atom’s velocity and 
reduces the frequency of the emitted light seen by an observer in 
the laboratory, whichever direction the atom moves. This shifts the 
centre of the observed atomic line by an amount that depends on the 
velocity distribution of the atoms, and therefore causes uncertainty 
in precision measurements; it is worse than mechanisms that just 
broaden the line shape symmetrically about the atomic resonance 
frequency. 


(e) Light shift The light shift, or a.c. Stark effect (Section 7.7), affects 
two-photon spectroscopy experiments because of the high intensities 
required to give reasonable transition rates. The shift of the centre 
of the observed line shape causes problems in precision experiments 
for the same reason as the second-order Doppler effect.?° 


Detailed calculations of all systematic effects on the 1s—2s transition 
frequency are given in Boshier et al. (1989) and McIntyre et al. (1989). 
Table 8.2 is a check-list of effects that may broaden the peaks in Doppler- 
free spectroscopy (and in some cases cause frequency shifts). 

In his original experiment, Lamb, and his student Retherford, mea- 
sured the shift between 2s 7S,/2 and 2p *P1/2 directly with radio- 
frequency spectroscopy but the line width in their experiment was large 


Table 8.2 Summary of broadening mechanisms in Doppler-free spectroscopy. 


(i) Natural broadening. 
(ii) | Collisions (pressure broadening). 
(iii) Finite interaction time (transit-time broadening). 
(iv) Second-order Doppler effect. 
(v) Instrumental width—laser bandwidth. 
(vi) External fields—Zeeman and Stark effects. 
(vii) Residual Doppler broadening—if the beams are not exactly 
counter-propagating. 
(viii) Power broadening—related to saturation of the transition (in 
saturation spectroscopy). 
(ix) A.c. Stark effect—shift caused by the electric field of the light 
in two-photon spectroscopy. 
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28 The light shift does not significantly 
affect 1s—2s experiments because only 
low-power ultraviolet beams are gener- 
ated by nonlinear mixing. 
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because of the rapid decay of the 2p level (as mentioned at the beginning 
of this example). The 2p level has a natural width of 100 MHz—much 
larger than the line width in two-photon experiments. It is quite remark- 
able that the laser measurements of a transition frequency in the ultravi- 
olet can exceed the precision of radio-frequency spectroscopy. Although 
the QED shifts represent only a very small part of the 1s—2s transition 
energy, the laser experiments determine these shifts accurately if the 
experimenters know the frequency of the laser. The following section 
describes methods used to measure the laser frequency and so calibrate 
the spectra. 


8.5 Calibration in laser spectroscopy 


Laser spectroscopy experiments use tunable lasers, i.e. laser systems 
whose frequency can be tuned over a wide range to find the atomic, 
or molecular, resonances. The early experiments used dye lasers in the 
visible region, e.g. the dye Rhodamine 6G gives the yellow light for ex- 
periments with sodium. The best dyes have a tunable range of over 
50nm and modern dyes exist that operate from the deep blue into the 
infra-red. However, the use of dyes in solution can be messy, and nowa- 
days many experimenters prefer to use solid-state lasers that operate 
in the infra-red (Davis 1996); semiconductor diode lasers have a tun- 
ing range of about 10nm, and the more general-purpose titanium-doped 
sapphire laser operates anywhere in the range 700—1000nm. In compar- 
ison, the He—Ne laser only works within the Doppler profile of the neon 
transition; this has a Doppler width ~ 1GHz at 633nm corresponding 
to a wavelength range of only 0.001nm. This fixed and well-defined 
wavelength can be used as a frequency reference (and similarly for other 
lasers operating on atomic transitions). 

The method of calibrating the laser frequency depends on whether the 
experiment requires absolute or relative measurements. 


8.5.1 Calibration of the relative frequency 


Experiments that measure the separations of the components within the 
spectrum require an accurate frequency scale for the laser scan, e.g. in 
the measurement of the isotope shifts and hyperfine splittings shown in 
Fig. 6.12. To calibrate the laser scan, experimenters send part of the 
laser beam through a Fabry-Perot étalon and record the transmission, as 
shown in Fig. 8.12 (cf. Fig. 8.6(a)). The observed fringes have a spacing 
equal to the étalon’s free-spectral range of c/2/ and 1, the length of the 
cavity, can be measured accurately. In practice these experiments do 
need some method that gives the approximate wavelengths of the laser 
light, in order to find the atomic lines. 
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1. Spectrum to be calibrated 
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Fig. 8.12 Calibration of a laser experiment. Three signals are recorded: (1) the spectrum to be calibrated, e.g. the absorption 
of an atomic vapour; (2) a molecular spectrum, e.g. the absorption spectrum of iodine; and (3) the intensity transmitted 
through a Fabry Perot étalon gives fringes with a frequency spacing equal to its free-spectral range c/2l, where l is the length 
of the étalon. These reference fringes provide the frequency scale. Molecular spectra have a ‘forest’ of lines so that there will 
be lines near any arbitrary wavelength and the individual lines can be identified by comparison with a known spectrum. These 
molecular lines give the absolute frequency. (The sodium cell is heated to give a vapour pressure sufficient for an absorption 


experiment.) 


8.5.2 Absolute calibration 


To determine the absolute frequency of a spectral line it is compared to 
a nearby line of known frequency (or wavelength)—the same principle 
as in the use of a calibration lamp to produce fiducial lines on a spec- 
trum obtained from a conventional prism spectrograph (or diffraction 
grating). Laser spectroscopists often use iodine to provide the reference 
lines because its molecular spectrum has many lines in the visible region 
and an atlas of iodine wavelengths has been compiled for this purpose 
(Gerstenkorn et al. 1993). Molecules have many more transitions than 
atoms and this gives a high probability of finding a suitable line near any 
frequency of interest. Iodine has sufficient vapour pressure at room tem- 
perature to give measurable absorption in a simple glass cell as shown in 
Fig. 8.12. The figure uses the Doppler-broadened absorption in sodium 
as an example of the spectrum to be measured. The iodine lines have 
much narrower Doppler widths because of the heavy molecular mass (Ip 
has molecular weight 254). 

The calibration of Doppler-free spectra often requires narrower refer- 
ence lines obtained by saturation spectroscopy on the iodine itself (see 
Corney 2000, Figs 13.13 and 13.14). The frequency of the 1s—2s tran- 
sition in atomic hydrogen described in Example 8.3 has been measured 
relative to a line in the saturated absorption spectrum of tellurium,”? 
as shown in Fig. 8.13. The experiment was calibrated by the following 
procedure. The saturation spectroscopy of Tez was carried out with 
blue light of wavelength 486nm and angular frequency wi. Some of 


sia heavy diatomic molecule like io- 
dine, but Tez happens to have lines in 
the blue region whereas Ig does not. 
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Fig. 8.13 (a) A two-photon spectrum 
of the 1s-2s transition in atomic hy- 
drogen as in Fig. 8.11 but on a differ- 
ent scale. (b) The saturated absorp- 
tion spectrum of molecular tellurium 
used for calibration. The absolute fre- 
quency of the line labelled 7 was deter- 
mined with an uncertainty of 6 x 10~!° 
(by auxiliary measurements). Adapted 
from McIntyre et al. (1989). Copyright 
1989 by the American Physical Society. 


(a) Hydrogen 1s—2s 
(243 nm) 
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this blue light was passed through a nonlinear crystal where the process 
of second-harmonic generation produced some radiation of frequency 
w = 2wy. The frequency of this ultraviolet radiation at 243nm was 
compared with the radiation (with a very similar frequency) that ex- 
cited the 1s -2s transition. Thus the two-photon resonance condition in 
eqn 8.20 is wig = 2w = 4uy. The 1s 7S)/2 F = 1 to 2s *Si72 F =1 
transition has almost exactly four times the frequency of the line 7 in 
the spectrum of Tez, and the small frequency offset can be measured 
precisely. 

This method of calibration in terms of known spectral lines begs the 
question of how to determine the frequencies of the reference lines them- 
selves in the first place. The short answer is that experimenters rely on 
the national standards laboratories around the world to measure suit- 
able reference lines and to establish internationally agreed frequency 
standards, e.g. the particular iodine line that coincides with the out- 
put of the He-Ne laser at 633nm has been measured very accurately. A 
helium—neon laser with its frequency controlled to be equal to that of the 
iodine line provides a portable frequency standard, i.e. one calibrated by 
the standard laboratory and then carried to the experimental laboratory 
to provide a reference (see Corney 2000, Section 13.10). The national 
standards laboratories must calibrate the secondary frequency standards 
in terms of the primary standard of time provided by the caesium atom 
clock at a frequency of 9GHz (as described in Chapter 6). Until re- 
cently, a frequency chain was required to relate an optical frequency to 
a microwave frequency standard. A frequency chain comprises many 
oscillators, such as microwave sources and lasers, whose frequencies are 
multiples of each other, as indicated in Fig. 8.14. To go from 9 GHz 
up to around 6 x 104 Hz (corresponding to visible light) required many 
different oscillators as links in the chain. All these devices must operate 
simultaneously and have their frequencies electronically controlled rela- 
tive to those of neighbouring oscillators this makes such high precision 
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measurements are a major undertaking. Recently, a new method has 
been invented that supersedes cumbersome frequency chains and makes 
the measurement of optical frequencies more straightforward. 


8.5.3 Optical frequency combs 


Recently, a new method of measuring optical frequencies has been in- 
vented that has revolutionised optical metrology. The new method re- 
lies on the ability to generate frequency combs using laser techniques, 
i.e. laser radiation that contains a set of regularly-spaced frequencies, as 
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Fig. 8.14 A frequency chain. The fre- 
quency of a line in the visible spectrum 
of molecular iodine is measured by set- 
ting up a chain of oscillators. The fre- 
quency of each oscillator is compared 
to a multiple or sum of the frequencies 
of the other oscillators. At the bot- 
tom of the chain is a microwave source 
whose frequency is measured with ref- 
erence to a caesium frequency stan- 
dard (the primary time standard as de- 
scribed in Section 6.4.2). A diode mixer 
produces a high number of harmonics 
of the microwaves, so that the first: few 
stages achieve multiplication by a fac- 
tor of 7. Nonlinear crystals are used for 
mixing mid-infra-red and visible radi- 
ation. The scheme shown here is one 
of the least complicated but it still in- 
volves a large number of devices. When 
the whole chain is operating it deter- 
mines the multiplication factor that re- 
lates the optical frequency to the cae- 
sium frequency standard at 9GHz. Af- 
ter Jennings et al. (1979). 
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Fig. 8.15 A frequency comb produced by a femtosecond laser system (and an optical fibre)—in reality there are many more 
regularly-spaced modes than are shown here. The frequencies spread over an octave so that. modes on the low- and high- 
frequency wings can be compared, using second-harmonic generation in a nonlinear crystal, to determine the frequency offset 


of the comb as in eqn 8.25. 


304 very short pulse of light propa- 
gates around the optical cavity of the 
laser formed by high-reflectivity mir- 
rors. One of these mirrors is less re- 
flective than the others so that it trans- 
mits a few per cent of the light. Each 
time the short pulse hits this output 
coupling mirror part of the pulse tray- 
els out of the cavity to give a steady 
train of short pulses that emerge from 
the laser with a time interval of trep be- 
tween them. This time interval between 
pulses equals the round trip length of 
the laser cavity L divided by the speed 
of light: trep = L/e. 


31 This behaviour is closely analogous 
to the situation of light reflected from 
a diffraction grating, where the angu- 
lar separation of the diffraction orders 
is inversely proportional to the spacing 
between the rulings, or slits for a trans- 
mission grating. For a detailed descrip- 
tion of Fourier transforms and diffrac- 
tion gratings see Brooker (2003). 


illustrated in Fig. 8.15. The frequency comb contains the frequencies 


f=fo + nfrep ; 


where frep is the frequency interval and fo is an offset from zero that 
we shall assume is smaller than frep (for this choice n will be a large 
positive integer). A laser produces such a frequency spectrum; however, 
there is not room here for a detailed description of the laser physics 
of such systems, see Davis (1996) or Meschede (2004).°° The frequency 
spectrum is related by a Fourier transform to a train of short pulses in 
the time domain; the time interval between pulses trep and the spacing 
in the frequency are related by frep = 1/trep.?* 

The frequency span of the comb, i.e. the width of the spectrum’s 
envelope in Fig. 8.15, is inversely proportional to the duration of each 
individual pulse. Actually, in the historical development of pulsed lasers 
this was viewed the other way around—the objective was to create the 
shortest pulses possible and this requires the laser medium to have gain 
over a wide spectral region. The titanium-doped sapphire laser used 
in the frequency comb experiments created pulses with a duration less 
than 100fs (< 107!%s, see Holzwarth et al. (2000)). Such femtosecond 
lasers have great technical importance, both for studying processes with 


(8.24) 
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very high time resolution and for creating pulses of extremely high peak 
intensity (by compressing a high-energy pulse into a very short time). 

The frequency frep is measured by directing some of the laser light 
onto a photodiode (see Fig. 8.16). The femtosecond laser does not pro- 
duce a sufficiently wide spread in frequencies for the specific scheme 
illustrated in Figs 8.15 and 8.16. This is obtained by sending the output 
of the femtosecond laser along a special highly-dispersive optical fibre, as 
indicated schematically in Fig. 8.16. The combination of the femtosec- 
ond laser and this special fibre produces radiation with the frequency 
spectrum in eqn 8.24, that spans a large spectral range, e.g. from 520 nm 
to 1170nm in the work of Udem et al. (2001), equivalent to a frequency 
range of 300 THz. For frep = 1 GHz this range of values corresponds to 
f= 251016 = 10°. 

After the frequency comb has been generated, the next stage is to 
determine the offset fo by an ingenious method devised by Professor 
Theodor Hansch and co-workers, namely a comparison, or self-referenc- 
ing, of the frequency of lines from different parts of the frequency comb. 
This is achieved by sending the light (from the low-frequency wing of 
the comb) through a frequency-doubling crystal; in this nonlinear opti- 
cal medium some radiation is generated at the second harmonic of the 
input frequency. The light emerging from the crystal has frequency com- 
ponents 2(n’ frep + fo), where n’ is an integer (the reason for introducing 
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Fig. 8.16 The experimental arrangement for the measurement of an optical frequency using a frequency comb from a fem- 
tosecond laser. Photodiode 1 measures the frequency interval between the laser modes, frep in eqn 8.24, and frep is maintained 
constant by electronic feedback from the frequency counter to the laser (the laser cavity length is kept fixed by adjusting the 
position of a mirror with a piezoelectric actuator). Photodiode 2 measures the beat frequency between modes in the low- and 
high-frequency wings of the comb (see Fig. 8.15). Photodiode 3 measures the frequency difference between one mode of the 
comb and the unknown laser frequency as in eqn 8.26. The diffraction gratings spread out the light so that only the relevant 
part of the frequency comb falls onto the detector, as explained in the text. Figure courtesy of Dr Helen Margolis, National 
Physical Laboratory; after Margolis et al. (2003). 


Diffraction 
grating 


PDLY | y 


174 Doppler-free laser spectroscopy 


320ther schemes have been demon- 
strated, e.g. choosing 2n = 3n’ so that 
the frequency comb does not have to be 
so wide and can be generated directly 
from a laser (eliminating the optical fi- 
bre in Fig. 8.16). 


33.4 diffraction grating is used to 
spread out the light at different wave- 
lengths so that only the high-frequency 
part of the spectral region where n = 
2n’ falls onto the detector. Light at 
other wavelengths produces unwanted 
background intensity that does not con- 
tribute to the signal. 


34-The beat frequencies with other com- 
ponents fall outside the bandwidth of 
the detector. 


35This method has also been used to 
calibrate a selection of molecular io- 
dine lines that can be used as secondary 
frequency standards, as described in 
the previous section (Holzwarth e¢ al. 
2000). 


both n! and n will become apparent shortly). This radiation mixes with 
some of the original light, whose frequency is given by eqn 8.24, ona | 
photodiode. The signal from this detector contains the frequencies 

f =2(n'frep + fo) — (tfrep + fo) = (2n' —n)frep + fo- (8.25) 
For a frequency comb that spans an octave there are frequency com- 
ponents with n = 2n’, i.e. high-frequency lines that have twice the fre- 
quency of lines on the low-frequency wing. For these lines eqn 8.25 
reduces to fo, and in this way the frequency offset is measured.*? In 
Fig. 8.16 the photodiodes 1 and 2 measure precisely the radio frequen- 
cies frep and fo, respectively, and hence determine the frequency of each 
line in the comb (eqn 8.24).3% 

The light from the calibrated frequency comb is mixed with some of 
the output of the continuous-wave laser whose frequency fp is to be: 
measured, whilst the remaining light from this second laser is used for 
experiments, e.g. high-resolution spectroscopy of atoms or molecules. 
The third photodiode measures the beat frequency, which is equal to 
the difference between f, and the nearest component of the frequency 
comb:34 


Foeat = it Sees a ee fil . (8.26) 


This beat frequency is measured by a radio-frequency counter. 

The unknown laser frequency is determined in terms of the three mea- 
sured frequencies as fp = n” frep + fo + foeat- It is assumed that fy is 
known with an uncertainty less than frep, so that the value of the integer 
n" is determined, e.g. when frep = 1 GHz (as above) and fy, ~ 5 x 10'4 
(corresponding to a visible wavelength) it is necessary to know fy to a 
precision greater than 2 parts in 10®, which is readily achieved by other 
methods. The measurement of radio frequencies can be carried out ex- 
tremely accurately and this frequency comb method has been used to 
determine the absolute frequency of very narrow transitions in atoms 
and ions,®° e.g. Ca, Hg*, Sr+ and Yb+—see Udem et al. (2001), Blythe 
et al. (2003) and Margolis et al. (2003). These experiments were limited 
by systematic effects such as perturbing electric and magnetic fields, 
that can be improved by further work. The uniformity of the spacing 
of the lines in the frequency comb has been verified to at least a few | 
parts in 10'°, and in the future it is anticipated that uncertainties in 
measurements of frequency of very narrow transitions in ions, trapped 
using the techniques described in Chapter 12, can be reduced to a few 
parts in 10'8. At such an incredible level of precision new physical ef- 
fects may show up. For example, it has been suggested that fundamen- 
tal ‘constants’ such as the fine-structure constant a may vary slowly on 
astrophysical time-scales, and this would lead to changes in atomic tran- 
sition frequencies with time. If such variations do indeed occur, then an 
inter-comparison of frequency standards that depend on different powers 
of @ over many years is potentially a way to observe them. 
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Further reading 


This chapter has focused on just a few examples of Doppler-free laser 
spectroscopy and calibration to illustrate the important principles. Such 
measurements of the transition frequencies in atomic hydrogen give a 
precise value for the Rydberg constant and the QED shift. More de- 
tails of the saturated absorption and two-photon methods are given 
in specialised books on laser spectroscopy by Letokhov and Chebotaev 
(1977), Demtréder (1996), Corney (2000) and Meschede (2004)—the 
last of these gives details of recent experiments on the 1s—2s transition 
in atomic hydrogen. Nowadays, laser spectroscopy is very widely used 
in more complex situations, e.g. liquids and solids. 

The monograph by Series (1988) on the spectrum of atomic hydrogen 
gives a comprehensive description that includes Lamb and Retherford’s 
historic experiment and later refinements of the radio-frequency tech- 
niques, as well as laser spectroscopy. The series of proceedings of the 
biennial International Conference on Laser Spectroscopy (published by 
Springer-Verlag) gives an overview of the current state of the art and 
current applications. The measurement of the absolute frequency of light 
using optical frequency combs is a relatively new technique but already 
it has had an important impact on optical frequency metrology (Udem 
eal. 2002). 
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Exercises 
(8.1) Doppler widths produce marker fringes with a frequency spacing of 
Calculate the Doppler width of a spectral line with 75 MHz. 


a wavelength of 589nm for (a) sodium vapour at 
1000K, and (b) a vapour of molecular iodine at 
room temperature. 


(8.2) Doppler broadening 
The two fine-structure components of the 2s—2p 
transition in a lithium atom have wavelengths of 
670.961 nm and 670.976nm (in a vacuum). Esti- 
mate the Doppler broadening of this line in a room- 
temperature vapour and comment on the feasibility 
of observing the weak-field Zeeman effect in lithium. 


Intensity transmitted © 
by étalon 
(marker fringes) 


Frequency 


am 
oy 
— 


(8.3) Crossed-beam technique 

The figure shows the fluorescent signal obtained 
from a crossed-beam experiment like that presented 
in Fig. 8.2. Radiation of wavelength 243 nm excited 
a single-photon transition in strontium atoms from 
an oven at a temperature of 900 K. Each peak is la- 
belled with the relative atomic mass of the isotope. 
The frequency scale was calibrated by sending some Frequency 
light through a Fabry—Perot étalon (cf. Fig. 8.12) to 


Intensity of 
fluorescence 


86 87 88 
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(8.4) 


Determine the line width of the peaks and the fre- 
quency shift between the even isotopes from the 
scan. The line width arises from residual Doppler 
broadening. Calculate the collimation angle of the 
atomic beam. 


Comment. This line has a normal mass shift of 
180 MHz between the two even isotopes. There are 
smaller contributions to the isotope shift from the 
specific mass and volume effects. 


Hyperfine structure in laser spectroscopy 

What is the physical origin of the interaction that 
leads to hyperfine structure in atoms? 

Show that hyperfine splittings obey an interval rule 
which can be expressed as 


AEpp-i = Ang, 


i.e. the splitting of two sub-levels is proportional to 
the total angular momentum quantum number F 
of the sub-level with the larger F. 

The naturally-occurring isotope of caesium (*°°Cs) 
has a nuclear spin of J = 7/2. Draw a diagram 
showing the hyperfine sub-levels, labelled by the ap- 
propriate quantum number(s), that arise from the 
654 /2 and 6 [Pp /2 levels in caesium, and the al- 
lowed electric dipole transitions between them. 
Explain the principle of Doppler-free saturation 
spectroscopy. 

The figure shows the saturated absorption spectrum 
obtained from the 6 *S1/2-6 *P3/2 transition in a 
vapour of atomic caesium, including the cross-over 
resonances which occur midway between all pairs of 
transitions whose frequency separation is less than 
the Doppler width. The relative positions of the 
saturated absorption peaks within each group are 
given below in MHz. 


%, 


Probe beam intensity 


A B C 1) E i 
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Using these data and the information in the dia- 
gram: 


(a) determine the extent to which the interval rule 
is obeyed in this case and deduce the hyper- 
fine parameter A,,;; for the 6 2 Sip and 6 P32 


levels; 

(b) estimate the temperature of the caesium 
vapour. (The wavelength of the transition is 
852 nm.) 


Hyperfine structure in laser spectroscopy 

The energy separation between the two hyperfine 
levels in the ns configurations of hydrogen is given 
by eqn 6.10, and for n = 1 this corresponds to 
a hyperfine transition frequency of Afyrs(1s) = 
1.4 GHz. 


(a) Determine the separation of the hyperfine sub- 
levels in the 2s 7S,/2 level of hydrogen, and 
compare your answer to the value in the cap- 
tion of Fig. 8.7. 


Show that the peaks utes aie in Fig. 8.11 have 
an expected separation of A furs(1s). Com- 
pare the expected value win that in the figure 
(e.g. by measurement with a ruler and using 
the frequency scale given). 


(b) 


i en \gil” 
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Frequency 
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(8.6) 


(8.7) 


Two-photon experiment 


Intensity 


0 200 


400 
Relative frequency of radiation (MHz) 


600 800 1000 


The above experimental scan comes from a two- 
photon experiment like that shown in Fig. 8.8. The 
transition from the 5p® ‘So ground level of xenon 
to a J = 0 level of the 5p°6p configuration was ex- 
cited by ultraviolet radiation with a wavelength of 
249 nm and the scale gives the (relative) frequency 
of this radiation. This J = 0 to J’ = 0 transition 
has no hyperfine structure and the peak for each iso- 
tope is labelled with its relative atomic mass. The 
xenon gas was at room temperature and a pressure 
of 0.3 mbar. Light from a blue dye laser with a 
frequency jitter of 1 MHz was frequency-doubled to 
generate the ultraviolet radiation and the counter- 
propagating beams of this radiation had a radius of 
0.1 mm in the interaction region. 

Estimate the contributions to the line width from 
(a) the transit time, (b) pressure broadening, (c) 
the instrumental width, and (d) the Doppler effect. 


Collision broadening of a two-photon transition 
The signal shown in Fig. 8.11 has a line width 
(FWHM) of about 10MHz. From the data given 
in Example 8.3 determine the maximum pressure 
of hydrogen which could have been used in that ex- 
periment. 


36See Boshier et al. (1989) and McIntyre et al. (1989). 


(8.8) 
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Later experiments*® measured the pressure broad- 
ening of the 1s—2s transition frequency to be 
20 GHz/bar for hydrogen atoms in a gas that is 
mostly helium atoms. Estimate the cross-section 
for collisions between metastable hydrogen and he- 
lium atoms. Comment on the size of this cross- 
section in relation to the size of atoms. 


Convolution of Lorentzian line shapes 

A simple quantitative model of saturated absorp- 
tion spectroscopy is given in Appendix D and this 
exercise examines some of the mathematical de- 
tails. 


(a) The convolution of two Lorentzian functions of 
equal width can be found using 


Tv 
L+y? 
(8.27) 
Calculate the integral in eqn D.6. Hence prove 
eqn D.7. 


The convolution of two Lorentzian functions of 
unequal widths is 


[- ee a ee 
-oo 1+ (2y—2)?1+2? 2 


(b) 


[- 1 1 
-o a? +(y+r) b?+(y—2) 
= (2 + *) 9 
ab J (2y)? + (a+b)? 
(8.28) 
Use this to show that taking into account the 
power broadening of the hole burnt in popula- 


tions by the pump beam leads to a predicted 
line width in saturation spectroscopy of 


I 
Isat ; 


Comment. The proof of eqns 8.27 and 8.28 re- 
quires the residue theorem for complex path in- 
tegrals. 
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The inhomogeneous magnetic field de- 
flects the atoms in the Stern—Gerlach 
experiment but has a negligible effect 
on the speed. 


2 This condition does not hold true for 
cold atoms moving through a standing 
wave of light where the intensity varies 
significantly over short distances (com- 
parable with the optical wavelength, 
see Section 9.7). 


3Chu and co-workers also developed a 
physically equivalent description. 


Laser cooling and trapping 


In previous chapters we have seen how laser spectroscopy gives Doppler- 
free spectra and also how other older techniques of radio-frequency and 
microwave spectroscopy can resolve small splittings, e.g. hyperfine struc- 
ture. These methods just observe the atoms as they go past,’ but this 
chapter describes the experimental techniques that. use the ee exerted 


many a net e.g. they have Sein. improved the stability of the 
caesium atomic clocks that are used as primary standards of time around 
the world. We shall look at the forces that laser light exerts on an atom 
in some detail since this aspect contains most of the atomic physics. In 
many of the cases studied in this chapter, the atom’s motion follows 
straightforwardly from Newton’s laws once the force is known—an atom 
behaves like a classical particle, localised at a particular point in space, 
when the atomic wavepacket has a spread which is small compared to 
the distance over which the potential energy varies.” 

The first laser cooling experiments were carried out on ions that were 
trapped by electric fields and then cooled by laser radiation. “In contrast.” 
it is difficult to confine atoms at room temperature, or above. because of 
the smaller electromagnetic forces on neutral particles. “Therefore the pt 
oneéring experiments used light, for ces to slow atoms in an atomic beam ~ 
and then confined the cold atoms with a magnetic field. The great s suc- 
cess of laser cooling led to the award of the 1997 Nobel prize in physics 
to Steven Chu, Claude Cohen- Tannoudji and William Phillips. To de- 


scribe | the development ‘of the sumone we consider their contributions 


on atoms in terms of sie — of eeiem The research group of 


Phillips used this force to slow an atomic beam ' (Section 9.2); Chu and _ 


co-workers then demonstrated the method known as the optical molasses 
technique, that cools the motion of. atoms in all three dimetisions to give~ 
a very cold atomic vapour | r (Section 9.3).; This, led directly _to the devel-” 
opment. of the so-called magneto- -optical trap \Section 9.4) used in the 
majority of atom- trapping expenmentsioday "acme 
The interaction of the atoms with the light field turned out to be 
much more subtle than first supposed, and experiments showed that the 
optical molasses technique produced even lower temperatures than pre- 
dicted. Cohen- Tannoudji and Jean Dalibard explained this behaviour 
by a new mechanism called Sisyphus cooling.* This mechanism is de-~ 


scribed towards the end of the chapter (Section 9.7) since it does not 


fali neatly into either category of the radiation force, i.e. (a) the scatter- 
ing force that arises from absorption of light and spontaneous emission, 
and (b) the dipole force, described in Section 9.6. The forces on micro- 
scopic particles have properties similar to the forces on individual atoms, 
and this analogy is used to introduce the dipole force in Section 9.5. 
Atoms that have been pre-cooled by the other laser cooling techniques 
described in this chapter can be cooled even further using Raman tran- 
sitions (Section 9.8). Finally, Section 9.9 describes an atomic fountain 
which requires laser-cooled atoms. 


9.1 The scattering force 


The idea that radiation has momentum (and energy) goes back to James 
Clerk Maxwell in the nineteenth century. It follows from the conserva- 
tion of momentum that when an object absorbs radiation its momentum 
changes. The force on the object equals the rate of change of momen- 
tum. Therefore the force equals the rate at which the light delivers 
momentum—this is the same as the rate at which the light delivers en- 
ergy divided by the speed of light.4 Therefore radiation of intensity J 
exerts a force on area A given by 


IA 
Fad = . 
1G 


(9.1) 
Equivalently, the radiation pressure is Frag/A = I/c. The quantity 
IA equals the power absorbed, e.g. for JA = 1W the force is F = 
3.3 x 10-9N. At a surface that reflects the radiation back on itself the 
momentum change is twice as large and gives twice the force in eqn 9.1. 
Although small, the radiation force has observable effects in astrophysics, 
e.g. the outward radiation pressure balances gravity in stars, and the 
tails of comets point away from the sun (rather than trailing behind 
as for shooting stars in the atmosphere). Note, however, that although 
radiation pressure does have some effect on the dust and ice particles that 
form the tail of a comet, the solar wind is also important—the stream of 
particles emanating from the sun hit the particles in the comet tail and 
even the relatively low pressure in space leads to a force comparable to 
that from radiation pressure.° Radiation forces have a dramatic effect 
on atoms because the peak absorption cross-section daps(Wo) is much 
greater than the physical size of the atom (see eqn 7.81).° 

Lasers produce well-collimated monochromatic beams of light that 
can slow atoms in an atomic beam, as illustrated in Fig. 9.1. A counter- 
propagating laser beam exerts a force of F = —oapsI/c on an atom, 
where the minus sign indicates a force in the opposite direction to the 
motion. This expression in terms of the absorption cross-section shows 
that the light does not have to be considered as quantised in order to 
calculate the force, but it is convenient to describe the processes in 
terms of photons. Each absorbed photon gives the atom a kick in the 
direction opposite to its motion and spontaneously-emitted photons go 
in all directions, so that the scattering of many photons gives an average 
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4The energy of radiation divided by its 
momentum equals c. For a photon 
energy fw w ae 
momentum Ak k  — 
Of course, Maxwell showed this by clas- 
sical electromagnetism, not in terms of 
photons. (The ratio does not depend on 
h.) This and other aspects of the pres- 
sure due to electromagnetic radiation 
are discussed in Bleaney and Bleaney 
(1976, Section 8.8). 


>The radiation from the sun has an 
intensity of 1.4kWm7~? at the Earth. 
Thus the radiation pressure at the 
Earth’s orbit is 5 x 10-°Nm7?, or 
slightly less than ~ 10~1° times atmo- 
spheric pressure. 


6 Alkali atoms in a vapour have a large 
fraction of their absorption strength 
concentrated in a narrow range centred 
at the frequency of the resonance line. 
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Fig. 9.1 For an atom moving towards 
the laser, each absorbed photon gives 
the atom a kick in the direction oppo- 
site to its motion and the scattered pho- 
tons go in random directions, resulting 
in a force that slows the atom. 


7 Generally speaking, intensity is more 
directly related to experimental param- 
eters than the Rabi frequency, but we 
shall use both J and © in this chapter. 
As noted previously, there are other 
definitions of the saturation intensity in 
common use that differ by a factor of 2 
from the one used here. 


8This statement relies on the compre- 
hensive description of two-level atoms 
interacting with radiation given in 
Chapter 7. 


Oven Laser 


m———_ 
Fecatt 


force that slows the atom down. The magnitude of this scattering force 
equals the rate at which the absorbed photons impart momentum to the 
atom: 


(9.2) 
The scattering rate is Rscatt = p22, and p22, the fraction of the popu- 
lation in level 2, is given in eqn 7.69, so that 

Ip 0772 
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Fecatt = (photon momentum) x (scattering rate) . 


Rscatt = (9.3) 
The frequency detuning from resonance 6 = w—wo+kv equals the differ- 
ence between the laser frequency w and the atomic resonance frequency 
wo taking into account the Doppler shift kv. The Rabi frequency and 
saturation intensity are related by I / Isat = 2?/T? (see eqn 7.86)’ and 
photons have momentum /ik, so that® 


ih Ll Tee 
- 2 
seatt MR TF [Tanz + 202 /E? 


As I + o the force tends to a limiting value of Fnax = AkT/2. The 
rate of spontaneous emission from two-level atoms tends to ['/2 at high 
intensities because the populations in the upper and lower levels both 
approach 1/2. This follows from Einstein’s equations for radiation inter- 
acting with a two-level atom that has degeneracy factors g; = g2 = 1. 

For an atom of mass M this radiation force produces a maximum 
acceleration that we can write in various forms as 


Once 

OM Aes 
where 7 is the lifetime of the excited state. The recoil velocity v, is the 
change in the atom’s velocity for absorption, or emission, of a photon 
at wavelength A; it equals the photon momentum divided by the atomic 
mass: v, = hk/M =h/(AM). For a sodium atom amax = 9 x 10° ms~?, 
which is 10° times the gravitational acceleration. For the situation shown 
in Fig. 9.1 the atom decelerates at a rate 


(9.4) 


(9.5) 


(9.6) 


where a is positive. For constant deceleration from an initial velocity vo 
at z = 0, integration gives the velocity as a function of distance: 


vp? — v? = 2az. (9.7) 


Typically, the deceleration is half the maximum value a = dmax/2 to 
ensure that atoms are not left behind.? Hence the stopping distance is 


Vo" 


eS (9.8) 


Qmax 


A typical apparatus for slowing a sodium atom (M ~ 23a.m.u.) uses the 
parameters in the following table, where the initial velocity vg is taken 
to be the most probable velocity in a beam (given in Table 8.1). 


Most probable velocity in beam (T’ = 900 K) Vo 1000 ms7* 
Resonance wavelength Xr 589 nm 
Lifetime of the excited state 5p 16 ns 
Recoil velocity vr =h/(AM) 3cms' 


Stopping distance (at half of 


ie 
the maximum deceleration) 2v07 / vr 1.1m 


A distance of 1m is a convenient length for an experiment, and all 
alkali metals require surprisingly similar stopping distances. Although 
the heavier elements have a lower deceleration, they also have lower 
initial velocities because of their higher mass and the lower temperature 
required to give sufficient vapour pressure in an oven for a high-flux 
atomic beam, e.g. eqn 9.8 gives Lo = 1.2m for rubidium and a most 
probable velocity of v9 = Vpeam = 360ms—' at T = 450K (from the 
data in Table 9.1). 

The majority of laser cooling experiments have been carried out with 
sodium or rubidium.'° 

The calculation of the stopping distance assumes a constant decelera- 
tion, but for a given laser frequency atoms only experience a strong force 
over a narrow range of velocities, Av ~ ['/k, for which the atoms have a 


Table 9.1 Properties of some elements used in laser cooling experiments. 


Element Atomic mass Wavelength of Lifetime of the 


number resonance (nm) excited state (ns) 
H 1 PALL 1.6 
Li ff 671 7aTl 
Na 23 589 16 
K 39 767 26 
Rb 85, 87 780 27 


Cs 133 852 ail 
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® Fluctuations of the force about its av- 
erage value arise from the randomness 
in the number of photons scattered per 
unit time, as described in Section 9.3.1. 


!0Other elements such as magnesium 
(an alkaline earth metal) have ultravi- 
olet transitions with shorter lifetimes 
that scatter photons of higher momen- 
tum. Thus the atoms stop in a shorter 
distance; however, it is technically more 
difficult to obtain continuous-wave ul- 
traviolet radiation. 
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‘IThis range is similar to the width of 
the hole burnt in the ground-state pop- 
ulation in saturated absorption spectro- 
scopy. In practice, power broadening 
makes the homogeneous width larger 
than the natural width. 


Fig. 9.2 The first Zeeman slowing ex- 
periment. The solenoid produces a 
magnetic field that varies with posi- 
tion along the atomic beam so that 
the Zeeman shift compensates for the 
change in the Doppler shift as atoms 
decelerate. A probe laser beam inter- 
sects the slow atomic beam at a point 
downstream and the laser frequency is 
scanned to give a fluorescent signal pro- 
portional to the velocity distribution, 
similar to that shown in Fig. 9.4 (from 
a different experiment). This procedure 
records the component of the atomic 
velocity along the probe beam and the 
angle of intersection must not be 90°. 
(Here we have the opposite requirement 
to that in Fig. 8.2.) Adapted from 
Phillips et al. (1985). 


range of Doppler shifts approximately equal to the homogeneous width 
of the transition.!! Atoms that interact strongly with the laser light slow 
down until the change in their Doppler shift takes them out of resonance 
with the light. This change must be compensated for in order to keep 
the force close to its maximum value throughout the slowing process. 


9.2 Slowing an atomic beam 


The two pioneering laser cooling experiments used different methods to 
compensate for the change in Doppler shift as the atoms slowed down. 
William Phillips and co-workers used the ingenious method shown in 
Fig. 9.2 in which the atomic beam travels along the axis of a tapered 
solenoid; the Zeeman effect of the varying magnetic field perturbs the 
atomic energy levels so that the transition frequency matches a constant 
laser frequency. In the other method the laser frequency was changed 
and this so-called chirp cooling is described in the next section. From 
eqns 9.7 and 9.8 we see that during constant deceleration the velocity 
at distance z from the starting point is given by 


(9.9) 


To compensate for the change in Doppler shift as the atoms slow down 
from vo to the chosen final velocity, the frequency shift caused by the 
Zeeman effect needs to obey the condition 

Blz 

Wo + suite) =w+kv. 

h 
On the left-hand side, the Zeeman shift for an atomic magnetic moment 
LB increases the atomic resonance frequency from wo, its value at zero 


(9.10) 


Detector 4 Collection 


optics 
4 Probe laser 
: beam 
Cooling laser 


ae beam 
Observation 


region 


Varying-field 
solenoid 


Na 
source 


Cc 


ae Ll ne 


field; on the other side of the equation, the Doppler shift adds to the 


laser frequency w. Hence we find from eqn 9.9 that the required magnetic 
field profile is 


1/2 

z 
B(z) = Bo (: = *) + Boias (9.11) 

Lo 

for 0 < z < Lo, where 

huo 
Bo = —. 2 
o=32 (9.12) 


If upp Boias ~ hw—hwo then the atoms come to a complete stop at the end 
of the tapered solenoid; generally, it is more useful to leave the atoms 
with a small velocity so that they travel out of the tapered solenoid 
to a region where experiments, or further cooling, can be performed. 
Figure 9.3(a) shows the field profile for w ~ wo and Bpias ~ 0, so that the 
maximum field at the entrance to the solenoid is about Bo; Fig. 9.3(b) 
shows the field profile for a different choice of Bpias that requires a lower 
magnitude of the field. 


Example 9.1 For a transition between a state with quantum num- 
bers F and Mp and an excited state with quantum numbers F” and 
Mr, the Zeeman effect causes an (angular) frequency shift of (gp Mr — 


(a) B(z) 


Bo 


XR 


Se 


(b) B(z) 
Bo — 
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Fig. 9.3 The magnetic field along the 
tapered solenoid in the first Zeeman 
slowing experiment (see Fig. 9.2) var- 
ied with position as shown in (a). This 
magnetic field is described by eqn 9.11. 
Nowadays, some experiments use the 
variant shown in (b) where the field 
drops to zero and then reverses; this 
gives the same decrease in velocity for 
a given change in field Bo, but has the 
following three advantages: (i) the field 
has a lower maximum value so that the 
coils need less current-turns; (ii) the ‘re- 
verse slower’ produces less field at po- 
sitions downstream, z > Lo, because 
the contributions from the coils with 
currents in opposite directions tend to 
cancel out; and (iii) the abrupt change 
in the field at the exit helps the atoms 
to leave the solenoid cleanly (the atoms 
see a sudden increase in the frequency 
detuning of the light from resonance 
that cuts off the radiation force). In 
a real solenoid the field changes gradu- 
ally, as illustrated in (c). This smooth- 
ing has little influence on the overall 
length required because it does not sig- 
nificantly affect the early part of the 
slowing process. 
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120Other alkalis have corresponding 
transitions between ‘fully-stretched’ 
states for which F has the maximum 
value for a given J and J; and Mp = F 
or Mr = —F. Sodium has a nuclear 
Spine ele—13/2) 


13 From eqn 9.10 we see that the radia- 
tion exerts the strongest force on atoms 
with a velocity vg at the end of the 
solenoid that is given by 


kug ~ wo + (9.14) 


UBBoias _ 

h 
The actual final velocity will be lower 
than vs because the atoms remain in 
the laser light after they have emerged 
from the solenoid and are slowed fur- 
ther. A rough estimate suggests that 
the atomic velocity is lower than vs by 
an amount corresponding to a Doppler 
shift of several line widths (e.g. 3I'/k), 
but this depends on the interaction 
time with the light (which itself de- 
pends on the velocity and distance trav- 
elled). It is difficult to obtain very 
low final velocities without stopping the 
atoms completely and pushing them 
back into the solenoid. 


gr Mr)ppB/h. For the 3p ?P3/2—3s 7S; /2 line in sodium, the transition 
between the hyperfine levels F’ = 3, Mp = 3 and F = 2, Mr = 2 has 
gr Mr: —grMr = 1, s0 its Zeeman shift is as assumed in eqn 9.10. This 
transition leads to a closed cycle of absorption and spontaneous emis- 
sion because selection rules dictate that the excited state can only decay 
back to the initial state. This transition was used in the first slowing 
experiment shown in Fig. 9.2.!* Hence for vp = 1000ms~! we find from 
eqn 9.12 that 


By = 0.12T, (9.13) 


which is well within the capability of standard magnet coils. In this 
magnetic field a sodium atom has a Zeeman shift equal to the Doppler 
shift of Af = vo/A = 1.7GHz (cf. the natural width [/27 = 10 MHz). 


The important feature of the Zeeman slowing technique is that it 
reduces the velocity of a large fraction of the atoms in a beam to a low 
final value ve. Any atoms that start with velocities within the range 
Uo to ve interact with the laser radiation at some position along the 
solenoid and are swept along in the slowing process. The calculations 
in this section show how this works in principle, but the equations do 
not give the value of the final velocity vp accurately for the following 
reason. The stopping distance is proportional to the square of the initial 
velocity (eqn 9.8), so during deceleration from vp = 100ms~! to vp = 
Oms‘ the atoms only move 1 cm; and deceleration from v9 = 33ms~! 
to zero occurs within 1mm. Thus the final velocity depends critically 
on what happens at the end of the solenoid and in the fringing field 
that extends beyond.'* In practice, the laser frequency is adjusted so 
that the atoms have sufficient velocity to continue along the apparatus. 
Various methods for extracting the atoms have been developed, such as 
that shown in Fig. 9.3(b) where the field changes abruptly at the end of 
the solenoid so the interaction with the light shuts off cleanly. 


9.2.1 Chirp cooling 


In the other pioneering experiment to laser cool a sodium beam, the 
laser frequency was changed to keep track of the Doppler shift as the 
atoms slowed down. This method has become known as chirp cooling—a 
chirped pulse is one in which the frequency sweeps rapidly. This name 
derives from an analogy with bird-song, where the pitch of the sound 
changes rapidly. We can calculate the sweep time from the number of 
photon kicks required to stop the atom: N = vo/vu,. An atom scatters 
photons at a maximum rate of [1/2 = 1/(2r). Therefore N photons 
are scattered in a time of 2N7r, or at half the maximum deceleration 
it takes twice as long, i.e. 4Nr. For a beam of sodium atoms with 
parameters given in Example 9.1, NV = 34000 and the sweep time is 
4Nr = 2x 107*s. The frequency of the light must be swept over a 
range of more than 1GHz in a few milliseconds. Tuneable dye lasers 
cannot scan this quickly and so the experimenters used electro-optic 
modulators and radio-frequency techniques to change the frequency of 
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the light. Nowadays, chirp cooling of heavy alkalis such as rubidium 
and caesium can be carried out by directly scanning the frequency of 
infra-red semiconductor diode lasers—Fig. 9.4 shows the results of such 
an experiment. It can be seen that the laser cooling sweeps atoms to 
lower velocities to produce a narrow peak in the velocity distribution. It 
is the spread of the velocities within this peak that determines the final 
temperature, not the mean velocity of these atoms. The atoms have a 
much smaller spread of velocities than at room temperature so they are 
cold. 


9.3. The optical molasses technique 


In an atomic beam the collimation selects atoms moving in one direction 
that can be slowed with a single laser beam. Atoms in a gas move in all 
directions and to reduce their temperature requires laser cooling in all 
three directions by the configuration of three orthogonal standing waves 
shown in Fig. 9.5—the light along the Cartesian axes comes from the 
same laser and has the same frequency. At first, you might think that 
this symmetrical arrangement has no effect on an atom since there are 
equal and opposite forces on an atom. However, the radiation forces from 
the laser beams balance each other only for a stationary atom, which is 
what we want to achieve. For a moving atom the Doppler effect leads 
to an imbalance in the forces. Figure 9.5(b) shows the situation for a 
two-level atom in a pair of counter-propagating beams from a laser with 
a frequency below the atomic resonance frequency (red frequency detun- 
ing). Consider what happens in the reference frame of an atom moving 
towards the right, as shown in Fig. 9.5(c). In this frame the Doppler 
effect leads to an increase in the frequency of the laser beam propagat- 
ing in the direction opposite to the atom’s velocity. This Doppler shift 
brings the light closer to resonance with the atom and thereby increases 
the rate of absorption from this beam. This leads to a resultant force 
that slows the atom down.'4 Expressed mathematically, the difference 
between the force to the right and that to the left is 
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Fig. 9.4 The atomic velocity distri- 
bution produced by chirp cooling cae- 
sium atoms with radiation from semi- 
conductor diode lasers. The trace 
shows the experimentally observed flu- 
orescence from the atoms as the laser 
frequency was scanned over a frequency 
range greater than the initial Doppler 
shift of the atoms in the atomic beam. 
From Steane (1991). 


14.4 similar situation arises for move- 
ment in any direction. 
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Fig. 9.5 ‘Optical molasses’ is the name 
given to the laser cooling technique that 
uses the configuration of three orthog- 
onal pairs of counter-propagating laser 
beams along the Cartesian axes shown 
in (a). The laser beams are derived 
from the same laser and have a fre- 
quency w that is slightly below the 
transition frequency between the two 
atomic levels 1 and 2. (b) A stationary 
atom in a pair of counter-propagating 
laser beams experiences no resultant 
force because the scattering is the same 
for each laser beam, but for a moving 
atom, as in (c), the Doppler effect leads 
to more scattering of the light propa- 
gating in the direction opposite to the 
atom’s velocity. (Part (c) is drawn in 
the rest frame of an atom moving at ve- 
locity v.) The imbalance in the forces 
occurs for all directions and damps the 
atomic motion. 
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but typically the second term is about 
w/I ~ 108 larger than the first term. 
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(b) (c) 
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Low velocities, ku < IT, have been assumed. This imbalance in the 
forces arising from the Doppler shift can be written as 


Fmolasses = SON (9.16) 


The light exerts a frictional, or damping, force on the atom just like 
that on a particle in a viscous fluid. This analogy led the Americans 
who first demonstrated the effect (Chu et al. 1985) to call it the optical 
molasses technique (like treacle, or honey)—a name that seems to have 
stuck! Differentiation of eqn 9.4 gives the damping coefficient as!® 
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ae 9.17 
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9.3 


The term I / Isa < 1 has been neglected in the denominator because 
this simple treatment of the optical molasses technique is only valid 
for intensities well below saturation where the force from each beam 
acts independently.‘® Damping requires a positive value of a and hence 
6 = wW— wp < 0, ie. a red frequency detuning (in accordance with the 
physical explanation of the optical molasses technique given above). For 
this condition the plots of the force in Fig. 9.6 have a negative gradient 
OF /Ov <0 at v = 0. 

The above discussion of the optical molasses technique applies to one 
of pair of a counter-propagating laser beams. For the beams parallel to 
the z-axis, Newton’s second law gives 


d {1 dv. 
( Mv?) = Mvz—— = 02zF molasses = —av? E (9.18) 


dt \2 dt 


The components of the velocity along the z- and y-directions obey sim- 
ilar equations, so that in the region where the three orthogonal pairs of 
laser beams intersect the kinetic energy E = 5M(v7 +v? +v?2) decreases: 


2 (9.19) 


Tdamp 


Under optimum conditions the damping time Taamp = M/(2a) is a few 
microseconds (see Exercises 9.7 and 9.8). This gives the time-scale for 
the initial cooling of atoms that enter the laser beams with velocities 
within the capture range of the optical molasses technique, i.e. velocities 
for which the force has a significant value in Fig. 9.6. Equation 9.19 gives 
the physically unrealistic prediction that energy tends to zero because we 
have not taken into account the heating from fluctuations in the force. 
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16 saturation in two counter- 
propagating beams could be taken 
into account by the replacement 
T/TIsat — 21 /TIsat in the denominator of 
the expression for @ (and also Rscatt), 
although if I/Isat is not negligible 
a simple rate equation treatment is 
not accurate. We will see that for 
real atoms, as opposed to theoretical 
two-level atoms, the light field needs 
to be considered as a standing wave 
even for low intensities. 


Fig. 9.6 The force as a function of the 
velocity in the optical molasses tech- 
nique (solid lines) for (a) 6 = —I°/2, 
and (b) 6 = —I. The damping is pro- 
portional to the slope of the force curve 
at v = 0. Note that the force is nega- 
tive for v > 0 and positive for v < 0, 
so the force decelerates atoms. The 
forces produced by each of the laser 
beams separately are shown as dotted 
lines—these curves have a Lorentzian 
line shape and they are drawn with an 
FWHM of I appropriate for low inten- 
sities. For 6 = 0 (not shown in the fig- 
ure) the forces from the two laser beams 
cancel each other for all velocities. The 
velocity capture range is approximately 
gel ji, 
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17 The spontaneously-emitted photons 
that go in the z- and y-directions 
lead to heating in those directions, 
that must be taken into account 
in a full three-dimensional treatment. 
For isotropic spontaneous emission the 
heating would be the same in all direc- 
tions, i.e. v? and vz would increase at 
the same rate as v2. Radiation from an 
electric dipole oscillator is not isotropic, 
but this turns out not to be important 
for reasons discussed below. 


Fig. 9.7 The recoil of an atom from 
each spontaneous emission causes the 
atomic velocity to change by the re- 
coil velocity in a random direction. 
Thus the atom undergoes a random 
walk in velocity space with steps of 
length vy. The equilibrium tempera- 
ture is determined by the balance be- 
tween this diffusive heating and the 
cooling. (For simplicity, only two di- 
rections are shown here.) 


9.3.1 The Doppler cooling limit 


We can write the force from a single laser beam as 


F= Fabs “le OF abs 25 F spont = OF spont : (9.20) 


The average of the force from the absorption of photons is the scattering 
force that we have already derived, Faps = Fscatt, and the random kicks 
from spontaneously-emitted photons average to zero, i.e. Fspont = 0. 
What we have not considered previously is the effect of the fluctuations 
in these two processes, i.e. OF pont and 6Faps- 

The spontaneous emission that always accompanies Fcatt causes the 
atom to recoil in random directions. These recoil kicks lead to a random 
walk of the velocity, as shown in Fig. 9.7 (analogous to the Brownian 
motion of microscopic particles in air). A random walk of N steps gives a 
mean displacement proportional to WN, or equivalently the mean square 
displacement equals NV times the square of the step length. During a 
time t an atom scatters a mean number of photons 


N = Fizentet Q (9.21) 


Spontaneous emission causes the mean square velocity to increase as 


v2 = Recattt x v2, or along the z-axis 


(v2 Nese = nu? Recattt : (9.22) 
Each spontaneous photon gives a recoil kick in the z-direction of hk cos 6 
and the factor 7 = (cos? @) is the angular average, e.g. for isotropic 
spontaneous emission 7 = 1/3.!" 

The fluctuations dF aps arise because the atom does not always absorb 
the same number of photons in a time period t. Each absorption is 
followed by spontaneous emission and the mean number of such events 
in time f¢ is given by eqn 9.21. Assuming that the scattering obeys 
Poissonian statistics, the fluctuations about the mean have a standard 
deviation of VN and cause a random walk of the velocity along the 
laser beam, on top of the change in velocity (acceleration or deceleration) 
caused by the mean force. This one-dimensional random walk caused by 
the fluctuations 6F 4p; leads to an increase in the velocity spread similar 


oS 


to that in eqn 9.22, but without the factor 7 (because all the absorbed 
photons have the same direction): 


( v2 he a Talecattt . (9.23) 


This gives the effect of 6F 4p; for a single laser beam. For an atom in two 
counter-propagating beams the radiation forces of the two beams tend 
to cancel (as in eqn 9.5) but the effect of the fluctuations is cumulative. 
The atom has an equal probability of absorbing photons from either 
beam (incident from the left and right, say) so it receives impulses that 
are randomly either to the left, or to the right, leading to a random walk 
of velocity along the beams.!8 

From eqns 9.22 and 9.23 we can find the heating arising from OR econ 
and 6F aps, respectively. Inserting these terms into eqn 9.18, and assum- 
ing that for a pair of beams the scattering rate is 2Rscatt (twice the rate 
for a single beam of intensity I), we find 


1 dv? = 
5M dt = (1 a n) Er (2Recatt) ie av? ) (9.24) 
where 1 
( 5 Mur (9.25) 


is the recoil energy. Equation 9.24 describes the balance between heat- 
ing and damping for an atom in a pair of counter-propagating beams, 
but in the optical molasses technique there are usually three orthogonal 
pairs of laser beams, as shown in Fig. 9.5. To estimate the heating in 
this configuration of six laser beams we shall assume that, in the region 
where the beams intersect, an atom scatters photons six times faster 
than in a single beam (this neglects any saturation). If the light field is 
symmetrical! then the spontaneous emission is isotropic. Thus averag- 
ing over angles gives n = 1/3, but the overall contribution from dF spont 
is three times greater than for a pair of the laser beams. Therefore the 
factor 1+ 7 in eqn 9.24 becomes 1 + 37 = 2 for the three-dimensional 
configuration.?° This gives the intuitively reasonable result that the ki- 
netic energy increases by twice the recoil energy 2E, in each scattering 
event—this result can be derived directly from consideration of the con- 
servation of energy and momentum in the scattering of photons (see 

Exercise 9.3). 
Setting the time derivative equal to zero in eqn 9.24 gives the mean 
square velocity spread in the six-beam optical molasses configuration as 
ay 2Rscatt 


v2 = 2F, : 
a 


(9.26) 


and similarly along the other laser beam directions. The kinetic energy 
of the motion parallel to the z-axis is related to the temperature by 
i Mv? = 4kpT (according to the equipartition theorem). Substitution 
for a and Recatt gives?! 


Anes = (225/10 ie 
4  —26/T 


ae (9.27) 
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18 An underlying assumption here is 
that the absorption from the laser 
beams is uncorrelated, so that two laser 
beams produce twice as much diffusion 
as a single beam (given in eqn 9.23). 
This is only a reasonable approximation 
at low intensities (I < Isat) where sat- 
uration is not significant, and, as in the 
derivation of eqn 9.17, we shall ignore 
the factor I /TJsat in the denominator 
of Rscatt. A more comprehensive treat- 
ment is given by Cohen-Tannoudji et 
al. (1992). 


19- Theoretically, this might seem diffi- 
cult to achieve since, even if all three 
pairs of beams have the same polariza- 
tion, the resultant electric field depends 
on the relative phase between the pairs 
of beams. In practice, however, these 
phases normally vary randomly in time 
(and with position if the beams are not 
perfectly aligned), so assuming that the 
light field is symmetrical over an aver- 
age of many measurements is not too 
bad. 


20 An alternative justification comes 
from considering the additional contri- 
bution along the z-axis from photons 
spontaneously emitted after absorption 
from the beams along the az- and y- 
axes, This contribution makes up for 
the fraction 1 — n of the spontaneous 
emission that goes in other directions 
following absorption from beams par- 
allel to the z-axis. There is detailed 
balancing between different directions 
because of the symmetry of the config- 
uration. 


li quation 9.17 can be written as 


a= 2hk2 ORscatt 
7 Ow 
—26 


= 2hk* ———._R 
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22 Pinstein pointed out in his funda- 
mental work on radiative absorption 
and emission processes of atoms in a 
thermal radiation field that the momen- 
tum exchange between light and mat- 
ter would bring the atoms into ther- 
mal equilibrium with the surroundings 
(Einstein 1917). When the radiation 
has a spectral distribution correspond- 
ing to 0K (monochromatic light) the 
atom would be expected to approach 
this temperature. 


23Narrow transitions with AT < E;, 
give rise to a different behaviour, that 
has some similarities to the discussion 
of cooling using narrow Raman transi- 
tions in Section 9.8. 


*4-The principal idea of magneto- 
optical trapping was suggested by Jean 
Dalibard and demonstrated at Bell 
Laboratories, USA in collaboration 
with a group from MIT. 


This function of s = —26/T has a minimum at 6 = w — wp = —I'/2 of 
AL 


This key result is the Doppler cooling limit. It gives the lowest temper- 
ature expected in the optical molasses technique. On general grounds, 
we expect a limit of this magnitude for processes in a two-level atom 
since AT = h/r represents the smallest energy scale in the system.?” For 
sodium Tp = 240 wK, which corresponds to a most probable velocity of 
0.5ms~'. This velocity can be written as 


1/2 1/2 
ma (F) =(Bz) =e, (29) 
where vu, ~ I'/k gives an estimate (to within a factor of 2) of the capture 
velocity for the optical molasses technique, i.e. the velocity range over 
which Fycatt has a significant value. For sodium v, = 0.03ms~! and 
Ue = 6ms—!, and the above treatment of the optical molasses technique 
is valid for velocities within this range.?° 

The theory as presented so far was initially thought to describe the 
optical molasses technique until experimental measurements found much 
lower temperatures under certain conditions, in particular when the 
Earth’s magnetic field was cancelled out. The two-level model of an 
atom cannot explain this sub-Doppler cooling. Real alkali atoms have 
degenerate energy levels (|I.JF My) states). Remarkably, this does not 
just complicate the situation but actually allows new cooling mecha- 
nisms to occur, as described in Section 9.7. This is a rare example in 
which things turned out to be much better than expected. The fact 
that Doppler cooling theory does not accurately describe the optical 
molasses experiments with alkali metal atoms gives an excuse for the 
rather cavalier treatment of saturation in this section. 


9.4 The magneto-optical trap 


In the optical molasses technique cold atoms accumulate in the region 
where the three orthogonal pairs of laser beams intersect because it 
takes a considerable time for atoms to diffuse out, e.g. several seconds 
for beams of 1cm radius. With the correct choice of polarizations for 
the laser beams, this configuration can be turned into a trap by the 
addition of a magnetic field gradient, as illustrated in Figs 9.8 and 9.9; 
the two coils with currents in opposite directions produce a quadrupole 
magnetic field. This magnetic field is much weaker than in the purely 
magnetic traps described in Chapter 10 and does not confine atoms by 
itself. In the magneto-optical trap (MOT) the quadrupole magnetic field 
causes an imbalance in the scattering forces of the laser beams and it 
is the radiation force that strongly confines the atoms.?* The principle 
of the MOT is illustrated in Fig. 9.9(a) for a simple J = 0 to J = 1 
transition. At the point in the middle of the coils the magnetic fields 


produced by the coils cancel out, so that B = 0. Close to this zero of the 
field there is a uniform field gradient that perturbs the atomic energy 
levels; the Zeeman effect causes the energy of the three sub-levels (with 
M,; = 0, +1) of the J = 1 level to vary linearly with the atom’s posi- 
tion, as shown for the z-axis in Fig. 9.9(a).2° The counter-propagating 
laser beams have circular polarization as shown in Fig. 9.9(b) and a fre- 
quency slightly less than the atomic resonance frequency. The Zeeman 
shift causes an imbalance in the radiation force in the following way. 
Consider an atom displaced from the centre of the trap along the z-axis 
with z > 0, so the AM, = —1 transition moves closer to resonance with 
the laser frequency—the laser has a frequency below the atomic reso- 
nance in zero field to give damping by the optical molasses mechanism.”° 
The selection rules lead to absorption of photons from the beam that 
excites the o~ transition and this gives a scattering force that pushes 
the atom back towards the trap centre. A similar process occurs for a 
displacement in the opposite direction (z < 0); in this case the Zeeman 
shift of the transition frequency and selection rules favour absorption 
from the beam propagating in the positive z-direction that pushes the 
atom back towards z = 0. Note that these beam polarizations and the 
quantisation axis of the atom have been defined relative to a fixed direc- 
tion in space, i.e. the z-direction in this one-dimensional example. For 
z > 0 this is the same as the direction of the magnetic field, but for z < 0 
the magnetic field points the opposite way; hence the Mj; = —1 state 
lies above +1 in this region, as shown in Fig. 9.9(a). Strictly speaking, 
ot and o” refer to transitions of the atom and labelling the radiation 
as ot is shorthand for circularly-polarized radiation of the handedness 
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Fig. 9.8 A pair of coils with cur- 
rents in opposite directions produces a 
quadrupole magnetic field. The field is 
zero at the centre of the coils and its 
magnitude increases linearly in every 
direction for small displacements from 
the zero point. 


25-The energy levels also vary in the 
other directions. ‘The Maxwell equation 
div B = 0 implies that 

dB, dB,  1dB, 

dx dy 2 dz’ 
so the gradient in any radial direction 
is half of that along the z-direction. 


26 The MOT requires three orthogonal 
pairs of o+-o~ beams, but the opti- 
cal molasses technique works with other 
polarization states, e.g. the Sisyphus 
cooling in Section 9.7 uses linearly- 
polarized beams. 
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Fig. 9.9 (a) The mechanism of a 
magneto-optical trap illustrated for the 
case of an atom with a J = 0 to 
J = 1 transition. In the magnetic field 
gradient the Zeeman splitting of the 
sub-levels depends on the atom’s posi- 
tion. Two counter-propagating beams 
of circularly-polarized light illuminate 
the atom and the selection rules for 
transitions between the Zeeman states 
lead to an imbalance in the radiative 
force from the laser beams that pushes 
the atom back towards the centre of 
the trap. (Not to scale; the Zeeman 
energy is much smaller than the opti- 
cal transition energy.) (b) A magneto- 
optical trap is formed from three or- 
thogonal pairs of laser beams, as in the 
optical molasses technique, that have 
the requisite circular polarization states 
and intersect at the centre of a pair 
of coils with opposite currents. The 
small arrows indicate the direction of 
the quadrupole magnetic field produced 
by the coils (as shown in more detail in 
Fig 9.8). 


(b) 


that excites the o* transition (and similarly for o~).?” To describe the 
magneto-optical trap mathematically we can incorporate the frequency 
shift caused by the Zeeman effect into eqn 9.15 (that describes the optical 
molasses technique):78 


Fuor = ine (w ~kuv—- (wo =F Bz) ) Ds att (w + ky — (wo = Bz)) 
OF OF 


The term wo+z is the resonant absorption frequency for the AM, = +1 
transition at position z, and wo — Gz is that for AM), = —1. The Zeeman 
shift at displacement z is 


(9.31) 


where g = g, in this case.”® The force depends on the frequency detuning 
6 = Ww — Wo, 80 OF /Owp = —OF /Ow and hence 


OF 
Fuot = am (kv + Bz) 
= —av - sic (9.32) 


The imbalance in the radiation force caused by the Zeeman effect leads 
to a restoring force with spring constant af/k (which is written in this 
form to emphasise that it arises in a similar way to damping). Under 
typical operating conditions the atom undergoes over-damped simple 
harmonic motion, as shown in Exercise 9.9. Atoms that enter the region 
of intersection of the laser beams are slowed (as in the optical molasses 
technique) and the position-dependent force pushes the cold atoms to the 
trap centre. This combination of strong damping and trapping makes 
the magneto-optical trap easy to load and it is very widely used in laser 
cooling experiments. 

A typical apparatus uses an MOT to collect cold atoms from a slowed 
atomic beam. When sufficient atoms have accumulated the magnetic 
field of the MOT is turned off to cool the atoms by the optical molasses 
technique before further experiments are carried out.2° This procedure 
gives more atoms (at a higher density) than the optical molasses tech- 
nique on its own because the MOT captures faster atoms than optical 
molasses. The magnetic field in the MOT changes the atom’s absorp- 
tion frequency in a similar way to the Zeeman slowing technique, e.g. if 
the magneto-optical trap has laser beams of radius 5mm and we take 
this as the stopping distance in eqn 9.8 then the trap captures sodium 
atoms with velocities less than v.(MOT) ~ 70ms~'. But atoms enter 
the MOT from all directions and the magnetic field varies linearly with 
position (a constant gradient), so the situation is not the same as the op- 
timum case of an atom moving along the axis of a tapered solenoid with 
the counter-propagating laser beam (see Exercise 9.10). Nevertheless, 
an MOT captures atoms with much faster velocities than the optical 
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27 This is a convenient convention for 
discussing the principles of laser cool- 
ing where the transitions that occur 
depend on the sense of rotation of 
the electric field around the quantiza- 
tion axis of the atom (whereas hand- 
edness depends on both the sense of 
the rotation and the direction of the 
propagation). The electric field of the 
circularly-polarized radiation drives the 
bound atomic electron(s) around in the 
same sense as the electric field; there- 
fore, radiation labelled o+ that imparts 
positive angular momentum about the 
quantization axis, AM; = +1, has 
an electric field that rotates clockwise 
when viewed along that quantization 
axis (see Fig. 9.9(b)), i.e. in the same 
direction as a particle with (L,) > 0. 
The magneto-optical trap also confines 
atoms along the z- and y-axes, and 
all other directions. In practice, these 
traps are extremely robust and only re- 
quire the polarizations of the beams to 
be approximately correct—some atoms 
are trapped so long as none of the 
beams have the wrong handedness. 


28This assumes a small Zeeman shift 
Bz <T in addition to the small velocity 
approximation ku <T. 


29 More generally, g = gp) Mpi—grpMp 
for a transition between the hyperfine- 
structure levels |F, Mr) and |F’, Mp); 
however, g ~ 1 for many of the transi- 
tions used for laser cooling—see Exam- 
ple 9.1. 


30 atoms in the MOT have a higher 
temperature than in the optical mo- 
lasses technique for several reasons: the 
sub-Doppler cooling mechanisms break 
down when the Zeeman shift exceeds 
the light shift and there is strong ab- 
sorption of the laser beams as they pass 
through dense clouds of cold atoms (see 
Exercise 9.12). 
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31 These elements have an appreciable 
vapour pressure at room temperature. 
The MOT captures the slowest. atoms 
in the Maxwellian velocity distribu- 
tion. The equilibrium number of atoms 
trapped directly into an MOT from a 
vapour is proportional to the fourth 
power of vc(MOT), so the method is 
very sensitive to this parameter. 


32 Broadening from collisions and the 
Doppler effect is negligible and broad- 
ening caused by the inhomogeneous 
magnetic field is small, e.g. for a typical 
field gradient of 0.1Tm~! and a cloud 
of radius 3mm the variation in the Zee- 
man shift is ~ 4 MHz (for g = 1). 


molasses technique, e.g. for sodium ve(MOT) > v-(molasses) ~ 6ms~* 


(eqn 9.29). This relatively large capture velocity makes it possible to 
load an MOT directly from a room temperature vapour, and this method 
can be used instead of slowing an atomic beam for the heavy alkalis ru- 
bidium and caesium (see Exercise 9.11).°! Typically, an MOT loaded 
from a slow atomic beam contains up to 10° atoms. Experiments that 
capture atoms directly from a vapour usually have considerably less. 
Such general statements should be treated with caution, however, since 
there are various factors that limit number and density in different op- 
erating regimes, e.g. absorption of the laser light—the cold atoms that 
congregate in the centre of the MOT have close-to-the-maximum optical 
absorption cross-section?*—absorption leads to a difference or an imbal- 
ance in the intensities of the laser beams propagating through the cloud 
of cold atoms that affects the trapping and cooling mechanisms. 

At equilibrium each atom absorbs and emits the same amount of light. 
Therefore a large cloud of cold atoms in an MOT scatters a significant 
fraction of the incident light and the atoms can be seen with the naked 
eye as a bright glowing ball in the case of sodium; for rubidium the 
scattered infra-red radiaticn can easily be detected on a CCD camera. 
The MOT provides a source of cold atoms for a variety of experiments, 
e.g. loading the dipole-force traps (as described in the following sections) 
and magnetic traps (Chapter 10). Finally, it is worth highlighting the 
difference between magneto-optical and magnetic trapping. The force in 
the MOT comes from the radiation—the atoms experience a force close 
to the maximum value of the scattering force at large displacements from 
the centre. The magnetic field gradients in a magneto-optical trap (that 
tune the absorption frequency of the atoms) are much smaller than those 
used in magnetic traps. A typical MOT has a gradient of 0.1T m7! and 
when the light is switched off this produces a magnetic force that is not 
sufficient to support atoms against gravity. 


9.5 Introduction to the dipole force 


The scattering force equals the rate at which an object gains momentum 
as it absorbs radiation. Another type of radiation force arises from the 
refraction of light as illustrated in Fig. 9.10. A simple prism that deflects 
light through an angle @ feels a force 


r=(2)(6) 


where the quantity [A/c corresponds to the rate at which radiation with 
intensity / carries momentum through a cross-sectional area A (perpen- 
dicular to the direction of propagation); this quantity corresponds to the 
total force when the radiation is absorbed (eqn 9.1). When the beam is 
refracted the difference between the incoming and outgoing momentum 
flow leads to the factor 2sin(@/2) by simple geometry. The angle and 
the resultant force increase with the refractive index. 


(9.33) 
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These simple considerations show that the forces associated with ab- 
sorption and refraction by an object have similar magnitude but they 
have different characteristics; this can be seen by considering a small 
dielectric sphere that acts as a converging lens with a short focal length, 
as shown in Fig. 9.11. In a laser beam of non-uniform intensity, the 
difference in the intensity of the light refracted on opposite sides of the 
sphere leads to a resultant force that depends on the gradient of the 
intensity: a sphere with a refractive index greater than the surrounding 
medium, sphere > Nmedium, feels a force in the direction of increasing 
intensity, whereas a sphere with nsphere < Mmedium 18 pushed away from 
the region of high intensity.°? Thus the sign of this gradient force (also 
known as the dipole force) depends on Msphere- The refractive index 
of materials varies with frequency in the characteristic way shown in 
Fig. 9.12. This behaviour can be understood in terms of a simple classi- 
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Fig. 9.10 Radiation that is deflected 
by a glass prism (or a mirror) exerts a 
force on that object equal and opposite 
to the rate of change of momentum of 
the radiation. 


Fig. 9.11 The refraction of light by a 
dielectric sphere with a refractive in- 
dex greater than that of the surround- 
ing medium. In a laser beam with a 
Gaussian profile the intensity along ray 
A is greater than for ray B. This leads 
to a resultant force on the sphere to- 
wards the region of high intensity (cen- 
tre of the laser beam), in addition to 
an axial force pushing in the direction 
of the beam. Analogous forces arise on 
smaller particles such as atoms. After 
Ashkin (1997). 


33-'The calculation of this force using ge- 
ometrical optics is straightforward in 
principle, but integration over all the 
different rays and inclusion of reflection 
coefficients makes it complicated. 
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Fig. 9.12 Absorption has a Lorentzian 
line shape with a peak at the resonance 
frequency wo. The refractive index is 
zero On resonance, where it changes 
sign, and this characteristic dependence 
on frequency leads to dispersion. 


Fig. 9.13 A tightly-focused beam of 
light exerts a radiation force on a di- 
electric sphere that pulls it towards the 
region of high intensity (at the focus). 
Not all of the light is transmitted at the 
interfaces and the reflected rays are in- 
dicated. After Ashkin (1997). 


34 Generally speaking, the effects of re- 
fraction are most apparent when they 
are not obscured by absorption, i.e. 
away from a resonance. A similar situ- 
ation arises for forces on the individual 
atoms. 


Refractive = 
index i \ Absorption 
\ i 


Dielectric 
sphere 


Refracted light 


cal model in which bound electrons execute damped harmonic oscillation 
with resonance frequency wo, see Section 7.5.1 and Fox (2001), but the 
relationship between the refractive index and absorption at the resonant 
frequency is very general (independent of any particular model). Dis- 
persion and absorption are different facets of the same interaction of 
light with matter; strong absorption leads to large changes in refractive 
index. The variation in the refractive index extends over a larger fre- 
quency range than the absorption, e.g. although air and glass (of good 
optical quality) are both transparent at visible wavelengths they have 
refractive indices of 1.0003 and 1.5, respectively, associated with strong 
absorption in the ultraviolet region; Ngiass — 1 >> Nair — 1 because a solid 
has a higher density of atoms than a gas.*4 

The force that attracts an object towards a region of high intensity 
has been used to manipulate microscopic objects in a technique called 
optical tweezers that was developed by Arthur Ashkin (Ashkin et al. 
1986). The objective lens in an optical microscope is used to focus a 
laser beam tightly so that there is a strong gradient force along the axis, 
as shown in Fig. 9.13, in addition to trapping in the radial direction 


shown in Fig. 9.11. As the laser beam is moved the particles remain 
trapped in the region of high intensity. The microscope is used to view 
the object through a filter that blocks the laser light. The objects are 
immersed in water and mounted on a microscope slide in a standard way. 
The liquid provides viscous damping of the motion.*® Optical tweezers 
works, not just for simple spheres, but also for biological cells such as 
bacteria, and these living objects can withstand the focused intensity 
required to trap them without harm (the surrounding water prevents 
the cells from heating up). For example, experiments have been carried 
out where a bacterium is tethered to the surface of a glass microscope 
slide by its flagellum (or ‘tail’) and the body is moved around by optical 
tweezers. This gives a quantitative measure of the force produced by the 
microscopic biological motor that moves the flagellum to propel these 
organisms (see Ashkin (1997) and Lang and Bloch (2003) for reviews). 

This section has introduced the concept of a radiation force called the 
gradient force, or dipole force, and the next section shows that a similar 
force occurs for atoms.°6 


9.6 Theory of the dipole force 


Actually, this section does not just derive the dipole force on an atom 
from first principles but also the scattering force, and so demonstrates 
the relationship between these two types of radiation force. An electric 
field E induces a dipole moment of —er = €9x,E on an atom with a 
(scalar) polarizability €9x.. The interaction energy of this dipole with 
the electric field is given by 

1 Pays! 

U = —=e9x,k* = ~er-E, (9.34) 

2 2 
where E£ is the amplitude of the electric field and U is used here to denote 
energy to avoid confusion with the electric field. This expression comes 
from the integration of dU = —eox,€ dé from € = 0 to € = E (the factor 
of 1/2 does not occur for a permanent electric dipole). Differentiation 
gives the z-component of the force as 


OU OE 
Bf, a 7 pe = €0Xal A 


(9.35) 
and similarly for F;, and Fy. Radiation of angular frequency w, prop- 
agating along the z-direction, can be modelled as an electric field E = 
Ep cos (wt — kz) €,.°" The gradient of the energy gives the z-component 
of the force as*® 


F, = —ex = cos (wt — kz) + kEp sin (wt — kz) ; (9.36) 


if Oz 
The two parts of this force can be understood using either the classical 
or the quantum mechanical expressions for the dipole that were derived 
in Chapter 7. The classical model of the atom as an electron undergoing 
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357 aser radiation can levitate small ob- 
jects in air, but this is less straightfor- 
ward than the manipulation of objects 
floating in water. 


36 This analogy is not just of pedagog- 
ical interest—the first experiments on 
optical tweezers and the dipole-force 
trapping of atoms were carried out in 
the same place (Bell Laboratories in the 
USA). 


37 This particular field is linearly polar- 
ized parallel to €y, as in Section 7.3.2, 
but the results derived here are quite 
general. 


38-This classical treatment gives the 
same result as the quantum mechan- 
ical derivation when the electric field 
varies slowly over the typical dimen- 
sions of an atomic wavepacket (Agp < 
Alight). Under these circumstances, 
classical equations of motion corre- 
spond to equations for the expectation 
values of the quantum operators, e.g. 
the rate of change of momentum equals 
the force corresponding to 


d(p) 

dt 
This is an example of Ehrenfest’s theo- 
rem in quantum mechanics. The quan- 
tum mechanical derivation of the dipole 


force is given in Cohen-Tannoudji ef al. 
(1992). 


Saye 
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39 4s shown in Section 7.5.1, the classi- 
cal model does not account for satura- 
tion. 


40 = co Quel: 
Using sin“ = cos* = 5. 


4lin optics, it is generally the effect 
of the medium on the light that is of 
interest, e.g. the angle through which 
the medium refracts, or bends, a light 
beam, but this implies that the medium 
feels a force equal to the rate of change 
of the momentum of the light. The 
refractive index, and absorption coeffi- 
cient, describe bulk properties, whereas 
it is the effect of light on individual 
atoms that is of interest here. 


simple harmonic motion gives very useful insight into the frequency de- 
pendence of the force, but the quantum treatment is required to find the 
correct intensity dependence.®? This section presents both approaches, 
starting with the classical one—because of the very close parallels be- 
tween them this requires little extra effort. 

Classically, the displacement of the electron x by an electric field is 
calculated by modelling the atom as a harmonic oscillator with a driving 
term. Expressing x in terms of its components in phase (/) and in 
quadrature (V) to the applied field, we find 


F, = —e {U cos (wt — kz) — Vsin (wt — kz)} 


9.37 
x { 72 cos (ut — ke) + asin (wt — ke) b. eg 
Zi 
The time average over many oscillation periods gives*® 
tal AS OEo 
{ae 5 { De ViE } 
oe —(w—wo)Eo _ OEo (B/2)kEG } 
~ Amu | (w — wo)? + (8/2)? Oz — (w— wo)? + (8/2)? J’ 
(9.38) 


using eqns 7.58 and 7.59 for U and V. The intensity of the light is 
I = Se9cEG and, by a simple extension of the derivation given above 
to the z- and y-directions, the radiation force can be written in vector 
notation as 


—(w aa Wo) VI 


ae {aeShee Pie 
2eome | (w— wo)? + (8/2)? w 


Ik } 

(w — wo)? + (8/2)? ¢ |k| J 

(9.39 
The in-phase component of the dipole (/) leads to a force proportional 
to the gradient of the intensity. The frequency dependence of this com- 
ponent follows a dispersive line shape that is closely related to the re- 
fractive index,*! as shown in Fig. 9.12. (The dependence on 1/w has 
a negligible effect on narrow transitions 0 < wo.) At the atomic reso- 
nance frequency w = Wo the component U = 0. The quadrature term, 
from Y, has a Lorentzian line shape and this force, arising from ab- 
sorption, is proportional to J and points along the wavevector of the 
radiation k. This classical model gives a simple way of understanding 
various important features of the forces on atoms and shows how they 
relate to radiation forces on larger objects (such as those discussed in 
the introductory Sections 9.1 and 9.5); however, we shall not use it for 
quantitative calculations. 

To find the force quantum mechanically, we use eqn 7.36 for the dipole 
moment in terms of the components of the Bloch vector u and v. Sub- 
stitution into eqn 9.36, and taking the time average as above, gives (cf. 
eqn 9.38) 


F= 


; 5 a (9.40) 


= Fgipole = Fscatt : 


Tis ~eX12 ju - vE ak} 


(9.41) 


The force that depends on the in-phase component of the dipole wu is 
the dipole force and the other part is the scattering force. Using the 
expressions for u and v given in eqn 7.68 and the Rabi frequency Q = 
eX 12Fo/h, we find that 


r 12/2 
fe L hk oO .Oml/art?pr/,? be 
: 262+ 92/24+T?/4 cee 
which is consistent with eqn 9.4, and 
ho 2 on. 
i eole = (9.43) 


2 62 +.02/2412/4 dz’ 


where 6 = w — Wo is the frequency detuning from resonance. The ex- 
pression for the scattering force has been repeated here for ease of com- 
parison with eqn 9.43. These forces have essentially the same frequency 
dependence as in the classical model, with a line width that is power 
broadened so that 6 <—> [ (1+ 2?/I?). The dipole force is zero on 
resonance (Fyjpole = 0 for 6 = 0), and for |6| >> T’ (and an intensity such 
that |6| >> Q) the dipole force equals the derivative of the light shift 


(eqn 7.93): . 
0 fAQ 
Fuipole X —~ | —— })- : 

ee = (FF) C=) 
Thus the light shift, or a.c. Stark shift, for an atom in the ground state 
acts as a potential Ugipole in which the atom moves. More generally, in 
three dimensions 
0 O 


e rae! 
Oz =f "By + @; =} Uaipole => —VU dipole ’ (9.45) 


Faipole = — {@ 5 ae 


where 
(0 ame 1) Gf 


Udipole ~ Gy = coe. 
When 6 is positive (w > wo) this potential has a maximum where the 
intensity is highest—the atom is repelled from regions of high intensity. 
In the opposite case of frequency detuning to the red (6 negative) the 
dipole force acts in the direction of increasing J, and Udipole is an at- 
tractive potential—atoms in a tightly-focused laser beam are attracted 
towards the region of high intensity, both in the radial direction and 
along the axis of the beam. This dipole force confines atoms at the fo- 
cus of a laser beam in an analogous way to optical tweezers to create 
a dipole-force trap.42 Normally, dipole traps operate at large frequency 
detuning (|6| >> I), where to a good approximation eqn 9.3 becomes 


(9.46) 


ee as 
catt — 8 §2 pn = 


Ri; (9.47) 
This scattering rate depends on J/6*, whereas the trap depth is pro- 
portional to J/6 (in eqn 9.46). Thus working at a sufficiently large fre- 
quency detuning reduces the scattering whilst maintaining a reasonable 
trap depth (for a high intensity at the focus of the laser beam). Usually 
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42-The situation for an atom with de- 
tuning 6 < 0 resembles that of a di- 
electric sphere with a refractive index 
greater than the surrounding medium. 


200 Laser cooling and trapping 


43\Whereas for scattering-force tech- 
niques the frequency of the laser must 
be tunable, so that it can be adjusted 
to within several line widths from an 
atomic transition frequency. 


44 This frequency detuning 6 ~ wo, so 
that the rotating-wave approximation 
is not very good. 


451 atoms gain twice the recoil energy 
per scattering event, as in eqn 9.24, it 
takes many seconds before the atoms 
boil out of the trap. The fluctuations 
in the dipole force itself can cause heat- 
ing and Fgipole + OF dipole Should be in- 
cluded in eqn 9.20. The fluctuations 
dF dipole give comparatively small heat- 
ing in a dipole-force trap with a large 
frequency detuning; however, there are 
circumstances where 6F dipole is the 
dominant cause of heating, e.g. for the 
Sisyphus effect described in Section 9.7. 


46 Writing eqn 9.50 as 
Rvcaee = om e (2)7 


where I’/2 is the maximum of Rscatt, 
shows that Fecatt is 5 x 107% times its 
maximum value. 


there are two important criteria in the design of dipole-force traps: (a) 
the trap must be deep enough to confine the atoms at a certain temper- 
ature (that depends on the method of cooling); and (b) the scattering 
rate must be low to reduce heating. 


Example 9.2 Dipole trapping of sodium atoms 

The wavelength of the laser light used for a dipole-force trap depends 
mainly on practical considerations.** It is convenient to use a high- 
power solid-state laser such as a Neodymium:YAG laser that produces 
continuous-wave radiation at a fixed infra-red wavelength of A = 1.06 ym. 
The frequency detuning of this laser radiation from the sodium resonance 
at Ao = 589 nm is 


(9.48) 


in units of [, where 1/[T = r = 16ns.** Solid-state lasers can produce 
powers of many tens of watts, but in this example we use a conservative 
value of P = 1W. When fecused to a waist of wo = 10 ym this laser 
beam has an intensity of I = 2P/(rw2) = 6.4x 109 Wm~? = 1x 108 gat. 
Equation 9.46 gives 

Uaipole = > 5 Al == A. pK , (9.49) 


Thus atoms cooled below the Doppler cooling limit AI'/2 can be trapped. 
For this laser intensity and frequency detuning, eqn 9.47 gives 


Ree 24 <0 ee (9.50) 
A sodium atom only scatters a few photons per second which gives a low 
heating rate.*° The scattering force is negligible for these conditions,*® 
i.e. the force pushing in the direction of the light is weaker than the 
dipole force pulling the atom towards the high-intensity focus. The 
condition that the laser light has a frequency detuning far from the 
atomic resonance is not restrictive, and calculations along the same lines 
as that given here for sodium show that a laser with the above properties 
can be used for the dipole-force trapping of any alkali metal atom. 


A force derived from a potential is conservative, i.e. the total energy 
remains constant during motion. Thus an atom that enters a dipole trap 
gains kinetic energy as it moves towards the bottom of the potential well 
and then it rides up the other side of the trap and escapes, because no 
energy is lost. To load a dipole trap there must be either some dissipa- 
tion of energy by spontaneous emission (as in the MOT), or the atoms 
must be placed gently in the bottom of the trap. In the first exper- 
imental demonstration of a dipole trap for atoms the laser beam was 
focused into a cloud of atoms that were cooled by the optical molasses 
technique, see Fig. 9.14 (Chu et al. 1986). The trapped atoms were 
observed as a bright spot in the region of more diffuse fluorescence from 


(a) o~ (b) 
|2) 


Atoms in 


optical molasses 


the region of optical molasses because the density of trapped atoms was 
greater. When the focused laser beam was first switched on the dipole 
trap contained relatively few atoms within its small volume, at a density 
comparable with that in the surrounding region, e.g. 10'°cm~*. This 
was perceived to be a problem, but atoms that started off outside the 
trap executed a random walk*’ that took some of them into the dipole 
trap, where they remained. In this way atoms accumulated in the trap 
to give a high density. 

A dipole-force trap formed by a single laser beam gives tight radial 
confinement, but it is weak in the axial direction. Therefore the atoms 
in such a trap form an elongated, cigar-shaped cloud. To obtain strong 
confinement in all directions, if necessary, one can form a dipole-force 
trap at the intersection of two laser beams.*® Many other configurations 
are possible and the design of dipole traps is restricted only by the 
form of the intensity distributions that can be sculpted from laser light. 
Diffraction limits the minimum distance over which the intensity of the 
light changes. An ingenious way of creating a high-intensity gradient is 
shown in Fig. 9.15. A laser beam that is totally internally reflected at 
the surface of glass gives an evanescent wave in which the electric field 
falls off exponentially over a distance of the wavelength of the light.*® 
For a laser frequency to the blue (6 > 0), the repulsive dipole force near 
the surface acts like a reflective coating for atoms. This creates a mirror 
that reflects low-energy atoms, as shown in Fig. 9.15. 


96.1 


The dipole force is strong in a standing wave of light because the inten- 
sity changes from a maximum (at the anti-nodes) to zero (at the nodes) 
over a distance of A/2 to give a high gradient of intensity. The physical 
explanation for this strong force is stimulated scattering of radiation. 
In a standing wave, an atom absorbs light with wavevector k from one 
beam and the laser beam in the opposite direction stimulates emission 
with wavevector k’ = —k; this gives the atom an impulse of 2k. The 
rate of this stimulated process does not saturate at high intensities.°° 


Optical lattice 
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Fig. 9.14 (a) An intense laser beam 
alters the energy levels of an atom, as 
illustrated for a radial direction across 
a laser beam propagating perpendicu- 
lar to the plane of the figure. For a 
laser frequency less than the atomic res- 
onance frequency the a.c. Stark effect 
forms a potential well in the ground- 
state energy and atoms are attracted 
towards regions of high intensity. (b) 
The dipole-force trap formed by a fo- 
cused laser beam can be loaded with 
cold atoms produced by the optical mo- 
lasses technique, as described in the 
text. 


47.4 random walk in space leading to 
spatial diffusion, rather than the ran- 
dom walk of the velocity leading to 
heating that was used to calculate the 
Doppler cooling limit—both processes 
are caused by scattering. 


= oanhe dipole potential is proportional 
to the total intensity. Laser beams 
with orthogonal polarizations, or sub- 
stantially different frequencies, do not 
interfere and the total intensity is the 
sum of the individual intensities. 


49- This behaviour of the light closely re- 
sembles the quantum reflection at a po- 
tential step that is higher than the en- 
ergy of the incident particle. The wave- 
function falls exponentially to zero in 
the classically forbidden region. 


50 More generally, the dipole force 
arises from a stimulated process of 
absorption of a photon of wavevec- 
tor k; and stimulated emission with 
wavevector kg. In this process the 
atom receives an impulse i(k; — ko) 
that changes the atom’s momentum. 
A-tightly-focused laser beam contains 
a range of wavevectors and exerts a 
dipole force on an atom analogous to 
that in the optical tweezers technique. 
A dipole force cannot occur in a plane 
wave since the stimulated processes 
have ki = ko. 
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Fig. 9.15 The evanescent wave, cre- 
ated when a laser beam is totally in- 
ternally reflected, forms a mirror for 
atoms. For a light with a blue detun- 
ing (w > wo) the dipole force repels 
atoms from the region of high inten- 
sity close to the vacuum~glass interface. 
(The same principle applies if the sur- 
face is curved (e.g. concave) and such 
an arrangement can be used to focus 
the matter waves.) 


51 This form is only true for large de- 
tunings, 6 >T,. If this inequality is 
satisfied then there is also little sponta- 
neous scattering from the atoms. 


52 Interference will generally lead to 
a periodic arrangement of positions 
where atoms become localised (at in- 
tensity maxima for a frequency detun- 
ing to the red) in a three-dimensional 
standing wave of light, but the creation 
of a particular configuration of the op- 
tical lattice requires control of the po- 
larization (and relative phase). 


clays phase-space density at which 
BEC occurs is approximately equal to 
that at which there is one atom per well 
in the ground state of an optical lattice. 


5445 atoms pass through a standing 
wave of light they accumulate a phase 
shift of order ¢ ~ Uot/h for an inter- 
action time t. Light with either sign of 
frequency detuning can be used to give 
g@ ~ +n, and so create a phase grating 
with significant amplitude in the dif- 
fraction orders. 
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The dipole potential associated with this force depends on the in- 
tensity of the light (cf. eqn 9.46 for a large frequency detuning). Two 
counter-propagating beams of linearly-polarized light produce an electric 
field given by 


E = Eo {cos (wt — kz) + cos (wt + kz) }e, 


= 2p cos (kz) cos (wt) €; . (9.51) 
This standing wave gives a dipole potential of the form®! 


Here Up is the light shift at the anti-nodes—these maxima have an inten- 
sity four times that of the individual beams. For a frequency detuning 
to the red, a standing wave of light traps atoms at the anti-nodes and 
gives confinement in the radial direction as in a single beam. This reg- 
ular array of microscopic dipole traps is called an optical lattice. With 
more laser beams the interference between them can create a regular 
array of potential wells in three dimensions, e.g. the same configuration 
of six beams in the optical molasses technique shown in Fig. 9.5 (along 
+e,, te, and te,) can create a regular cubic lattice of potential wells 
for suitable polarizations and a large frequency detuning.°? The poten- 
tial wells in this optical lattice have a spacing of 4/2, and so one atom 
per lattice site corresponds to a density of 8/® ~ 7 x 10'2cm~? for 
A = 1.06um. Therefore the sites will be sparsely populated when the 
atoms are loaded into the lattice after cooling by the optical molasses 
technique. (A typical number density in the optical molasses technique 
is a few times 101° cm~* = 0.01 atoms pym~?.) 

Experiments that load more than one atom in each potential well have 
been carried out by adiabatically turning on the light that creates an 
optical lattice in a region containing a sample of atoms that are in a 
Bose-Einstein condensate (see Chapter 10).°* Moreover, these atoms go 
into the lowest vibrational level in each of the potential wells. The use of 
one-dimensional standing waves as diffraction gratings for matter waves 
is discussed in Chapter 11.°4 


ON 


9.7 The Sisyphus cooling technique 


9.7.1 General remarks 


The dipole force experienced by atoms in a light field can be stronger 
than the maximum scattering force because Fyipole does not saturate 
with increasing intensities (whereas Fycatt does), but stimulated pro- 
cesses alone cannot cool atoms. To dissipate energy there must be some 
spontaneous emission to carry away energy from the atoms—this is true 
for all cooling mechanisms, e.g. Doppler cooling by the scattering force, 
and it is particularly apparent for the process described in this section. 

The first experimental evidence that Doppler cooling does not give a 
complete description of the laser cooling in a standing wave came from 
measurements of the velocity distribution of atoms by the direct time-of- 
flight method shown in Fig. 9.16. When William Phillips and co-workers 
carried out such measurements they were pleasantly surprised to find 
that the optical molasses technique can cool atoms below the Doppler 
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Fig. 9.16 (a) The temperature of a sample of atoms that has been cooled by the optical- molasses technique is measured by 
turning off the six laser beams (not shown) so that the cloud of cold atoms falls downwards to the bottom of the vacuum 
chamber (because of gravity). The expansion of the falling cloud depends on the initial spread of the velocities. To observe 
this expansion a horizontal probe laser beam is aligned several centimetres below the initial position of the cloud. This probe 
beam has a frequency close to the atomic resonance frequency w ~ wo so that atoms scatter light as they pass through and this 
fluorescence signal is recorded (or absorption could be monitored). (b) Instead of just dropping the atoms, they can be launched 
upwards to form an atomic fountain. This configuration is used for precision measurements, as described in Section 9.9. 
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Fig. 9.17 The laser cooling mechanism 
in a standing wave with a spatially- 
varying polarization. The energy levels 
of the atom are perturbed by the light 
in a periodic way, so that the atoms 
travel up and down hills and valleys 
(maxima and minima) in the potential 
energy. Kinetic energy is lost when the 
atom absorbs laser light at the top of 
a hill and emits a spontaneous photon 
of higher frequency, so that it ends up 
in a valley. This has been called the 
Sisyphus effect and can be made more 
probable than the reverse process, so 
that there is strong laser cooling. Thus 
atoms in a standing wave are cooled be- 
low the Doppler cooling limit (the low- 
est temperature achievable with scat- 
tering force alone). 


cooling limit in eqn 9.28. This cannot be understood in terms of a simple 
picture in which the scattering forces from each of the six laser beams 
add independently. Sodium, and other alkalis, have Zeeman structure 
in their ground states and this additional complexity, as compared to a 
simple two-level atom, allows new processes to occur. 

A particularly important mechanism by which atoms dissipate energy 
as they move through a standing wave is the Sisyphus effect that was ex- 


plained by Jean Dalibard and Claude Cohen-Tannoudji (1989), and this 
section follows the description given in that seminal paper. Steven Chu 
and co-workers also developed a model to explain sub-Doppler cooling 
based on the transfer of population between the different sub-levels of 
the ground configuration (optical pumping) as atoms move through the 
light field. This transfer of populations takes place on a time-scale Tpump 
that can be much longer than the spontaneous lifetime (Tpump > 1/T). 
This longer time-scale gives better energy resolution than in a two-level 
atom and therefore allows cooling below the Doppler cooling limit, i.e. 
kpT ~ h/Tpump < AT /2. A specific example of this general argument is 
shown in Fig. 9.17, and the following section gives more details. 


9.7.2. Detailed description of Sisyphus cooling 


Consider an atom that has a lower level with angular momentum J = 1/2 
and an upper level with J' = 3/2 that moves through a standing wave 
formed by two counter-propagating laser beams with orthogonal linear 
polarizations (e.g. along €, and e,). The resultant polarization depends 
on the relative phase of the two laser beams and varies with position, 
as shown in Fig. 9.18(b). This polarization gradient causes the periodic 
modulation of the light shift of the states in the lower level. The strength 
of the interaction with the light depends on My and M, in the lower 
and upper levels, respectively. To understand this in detail, consider 
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Fig. 9.18 Details of the Sisyphus cooling mechanism. (a) The electric dipole transitions between two levels with angular 
momenta J = 1/2 and J’ = 3/2. The relative strength of each transition is indicated—this gives the relative intensity when the 
states in the upper level are equally populated (each state has the same radiative lifetime). (b) The polarization in a standing 
wave formed by two laser beams that propagate along €. and —e., and have orthogonal linear polarizations along éz and é,, 
respectively. The resultant electric field is circularly polarized (€, + ié,) /W2 at positions.where the two counter-propagating 
beams have a phase difference of +7/2. The polarization changes from ot to o~ over a distance of Az = \/4, and between 
these positions the light has elliptical or linear polarization. (c) The energies of the states at positions of @~ and o* polarization 
(the unperturbed energy of the lower level is shown as a dotted line). Absorption of the circularly-polarized light followed by 
spontaneous emission transfers the population into the state with lowest energy (largest light shift). (d) The light shift varies 
with position and the optical pumping process, outlined in (c), transfers atoms from the top of a hill to the bottom of a valley 
(as shown in Fig. 9.17); or at least this process in which atoms lose energy happens more often than the other way around. 
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Fig. 9.19 The intensities of the com- 
ponents of the J = 1/2 to J = 3/2 
transition are represented by a, b and 
c (as in Example 7.3) and their rel- 
ative values can be determined from 
sum rules. The sum of the intensities 
from each of the upper states is the 
same: a = b+ c; because normally the 
lifetime of an atom does not depend 
on its orientation. (A similar rule ap- 
plies to the states in the lower level but 
this does not yield any further informa- 
tion in this case.) When the states of 
the upper level are equally populated 
the atom emits unpolarized radiation; 
hence a +c = 2b. We now have two 
simultaneous equations whose solution 
i = 2a and c = 1a, so the relative 


3 
intensities area: b:c=3:2:1. 


55This is the same convention for de- 
scribing polarization that we used for 
the magneto-optical trap; ot and o7 
refer to transitions that the radiation 
excites in the atom, AM, = +1, re- 
spectively. In laser cooling we are 
mainly interested in determining what 
transitions occur, and this depends on 
the sense of rotation of the electric 
field around the quantization axis of the 
atom (as described in Section 2.2). As 
stated previously, the electric field of 
the radiation drives the bound atomic 
electron(s) around in the same sense 
as the electric field; the circularly- 
polarized radiation travelling parallel to 
the quantization axis that is labelled 
o+ imparts positive angular momen- 
tum to the atom. The handedness of 
the polarization can be deduced from 
this statement for a given direction of 
propagation, if necessary. 
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a position where the light has o+ polarization;°° here the interaction 
for the My = 1/2 to My = 3/2 transition is stronger than that for 
the My = —1/2 to My = 1/2. (The squares of the Clebsch-Gordan 
coefficients are 2/3 and 1/3, respectively, for these two transitions, as 
determined from the sum rules as shown in Fig. 9.19). For light with 
a frequency detuning to the red (6 < 0), both of the M, states in the 
lower level (J = 1/2) are shifted downwards; the M; = +1/2 state 
is shifted to a lower energy than the My; = —1/2 state. Conversely, 
at a position of o~ polarization the M; = —1/2 state is lower than 
the M; = 1/2 state. The polarization changes from o~ to ot over a 
distance of Az = X/4, so that the light shift varies along the standing 
wave, as shown in Fig. 9.18(d). An atom moving over these hills and 
valleys in the potential energy speeds up and slows down as kinetic and 
potential energy interchange, but its total energy does not change if it 
stays in the same state. 

To cool the atom there must be a mechanism for dissipating energy 
and this occurs through absorption and spontaneous emission—the pro- 
cess in which an atom absorbs light at the top of a hill and then decays 
spontaneously back down to the bottom of a valley has a higher proba- 
bility than the reverse process. Thus the kinetic energy that the atom 
converts into potential energy in climbing the hill is lost (taken away 
by the spontaneously-emitted photon); the atom ends up moving more 
slowly, at the bottom of a valley. This was dubbed the ‘Sisyphus’ effect 
after a character in Greek mythology who was condemned by the gods 


to repeatedly roll a stone to the top of a hill.°6 

To explain the transfer between the M, states of the lower level let 
us again consider the details of what happens at a particular position 
where the light has o* polarization (see Fig. 9.18(d)). Absorption of 
ot light excites an atom from the state Mj = -1/2 to My = 1/2. An 
atom in this excited state may decay to either of the lower M, states; 
if it returns to M; = —1/2 then the process restarts, but it may go into 
the M; = +1/2 state from which it cannot return (because o* light 
excites the transition from Mj = 1/2 to My = 3/2 and the excited 
state of this transition only decays to My = +1/2). Thus the o® light 
results in a one-way transfer My = —1/2 to My; = +1/2 (via an excited 
state). This process in which absorption of light transfers population 
into a given state is known as optical pumping.®” In Sisyphus cooling the 
optical pumping at a position of o* polarization takes an atom from 
the top of a hill, in the potential energy for the My = —1/2 state, and 
transfers it down into a valley of the potential energy for the Mj = 1/2 
state. The atom continues its journey in the M; = 1/2 state until it gets 
optically pumped back to My = —1/2 at a position of o~ polarization.°® 
In each transfer the atom loses an energy Up approximately equal to the 
height of the hills (relative to the bottom of the valleys). This energy is 
roughly equal to the light shift in eqn 9.46.°° 

This physical picture can be used to estimate the rates of cooling and 
heating in the Sisyphus mechanism—the balance between these gives 
the equilibrium temperature (cf. the treatment of Doppler cooling in 
Section 9.3).®° Such a treatment shows that atoms in a standing wave 
have a typical kinetic energy ~ Up. This suggests that the Sisyphus 
mechanism works until atoms can no longer climb the hills and remain 
stuck in a valley (cf. an optical lattice). This simple picture predicts 
that the temperature is related to the intensity and frequency detuning 
by 


ng 
eS Oa 


which is borne out by more detailed calculation. 


(9.53) 


9.7.3 Limit of the Sisyphus cooling mechanism 


In a typical optical molasses experiment there are the following two 
stages. Initially, the laser beams have a frequency several line widths 
below the atomic resonance (6 ~ —I) and intensities ~ Isa, to give a 
strong radiation force. Then the laser frequency is changed to be further 
from resonance (and the intensity may be reduced as well) to cool the 
atoms to lower temperatures below the Doppler limit. The initial stage 
of Doppler cooling, as described in Section 9.3, is essential because the 
sub-Doppler cooling mechanisms only operate over a very narrow range 
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°6In addition to Sisyphus cooling, 
Dalibard and Cohen-Tannoudji found 
another sub-Doppler cooling mecha- 
nism called motion-induced orientation. 
This mechanism leads to an imbalance 
in scattering from counter-propagating 
beams that is much more sensitive to 
velocity, and hence produces a stronger 
damping than the imbalance caused by 
the Doppler effect in the ‘ordinary’ op- 
tical molasses technique. In a standing 
wave made from beams of opposite cir- 
cular polarization (at to o—), a sta- 
tionary atom has the population dis- 
tributed over the magnetic sub-levels of 
the ground state in a symmetrical way, 
so that there is equal scattering from 
each beam and no net force. An atom 
moving through a gradient of polariza- 
tion, however, sees a changing direction 
of the electric field and this causes a 
change in the distribution over the sub- 
levels (orientation by optical pumping) 
leading to a difference in the proba- 
bility of absorption from each beam. 
In real optical molasses experiments, 
the three mutually-orthogonal pairs of 
laser beams create a complex three- 
dimensional pattern of polarization and 
a combination of sub-Doppler cooling 
mechanisms takes place. 

57 Optical pumping in atomic vapours 
at room temperature was used for 
very precise radio-frequency spectro- 
scopy even before the laser was in- 
vented, e.g. to measure the splitting 
between the Zeeman sub-levels as de- 
scribed in Thorne (1999) and Cor- 
ney (2000). 


58-The atom may travel over many hills 
and valleys between excitations, and 
the absorption and emission does not 
always remove energy, but averaged 
over many events this stochastic pro- 
cess dissipates energy. 


59 Actually, it is about two-thirds of 
the light shift for the case shown in 
Fig. 9.17. 


oe Tine heating arises from fluctuations 
in the dipole force—the direction of this 
force changes as an atom jumps from 
one M, state to another, See Metcalf 
and. van der Straten (1999) for a quan- 
titative treatment. 
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61 Broadly speaking, in Sisyphus cool- 
ing the force averages to zero for atoms 
that travel over many hills and val- 
leys in an optical-pumping time. Thus 
this mechanism works for velocities v 
such that vtpump S A/2. This veloc- 
ity range is less than the capture ve- 
locity for Doppler cooling by the ratio 
T/Tpump- 


62This assumes that each degree of 
freedom has energy 

sheT: =e (9.54) 
63 Teavy alkalis such as Cs and Rb have 
a very low recoil limit and these el- 
ements can be laser cooled to a few 
pK. Such temperatures can only be 
achieved in practice when stray mag- 
netic fields that would perturb the Mp 
states are carefully controlled—a Zee- 
man shift 43B comparable with the 
light shift Uo will affect the Sisyphus 
cooling mechanism, i.e. if up B ~ Uo. 
For Uo/kp = 3 uK this criterion implies 
that B < 5 x 1075, which is an or- 
der of magnitude less than the Earth’s 
magnetic field (5 x 10-4 T). 


64-The assumption that the distribution 
has a Gaussian shape becomes worse 
at the lowest velocities of only a few 
times the recoil velocity—the small- 
est amount by which the velocity can 
change. Commonly, the distribution 
develops a sharp peak around v = 0 
with wide wings. In such cases the 
full distribution needs to be specified, 
rather than a single parameter such as 
the root-mean-square velocity, and the 
notion of a ‘temperature’ may be mis- 
leading. This remark is even more rel- 
evant for cooling below the recoil limit, 
as described in the following section. 


85In a high-intensity standing wave, a 
combination of the dipole force and 
spontaneous scattering dissipates the 
energy of a two-level atom, as shown by 
Dalibard and Cohen-Tannoudji (1985). 
This high-intensity Sisyphus mecha- 
nism damps the atomic motion for a 
frequency detuning to the blue (and the 
opposite for the low-intensity effect), 
and the hills and valleys in the poten- 
tial energy arise directly from the varia- 
tion in intensity, as in an optical lattice, 
rather than a gradient of polarization. 


of velocities.*? 

The equilibrium temperature in sub-Doppler cooling does not decrease 
indefinitely in proportion to I/|6|. Sisyphus cooling stops working when 
the loss in energy in going from the top of a hill (in the potential energy) 
to the valley bottom is balanced by the recoil energy acquired in sponta- 
neous emission, Up ~ E,. For this condition there is no net loss of energy 
in optical pumping between M, states. Thus the lowest temperatures 
achieved are equivalent to a few times the recoil energy, T ~ E,/kg. At 
this recoil limit®? the temperature is given by 


ee, 2 
ck oo (9.55) 


For sodium the temperature at the recoil limit is only 2.4uK (cf. Tp = 
240 vK). Typically, the optical molasses technique can reach tempera- 
tures that are an order of magnitude above the recoil limit, but still well 
below the Doppler cooling limit.°* 

The meaning of temperature must be considered carefully for dilute 
gas clouds. In a normal gas at room temperature and pressure the in- 
teratomic collisions establish thermal equilibrium and give a Maxwell- 
Boltzmann distribution of velocities. A similar Gaussian distribution 
is often obtained in laser cooling, where each atom interacts with the 
radiation field independently (for moderate densities, as in the opti- 
cal molasses technique) and an equivalent temperature can be assigned 
that characterises the width of this distribution (see eqn 8.3).6* From 
the quantum point of view, the de Broglie wavelength of the atom 
is more significant than the temperature. At the recoil limit the de 
Broglie wavelength roughly equals the wavelength of the cooling radi- 
ation, AaB ~ Aight, because the atomic momentum equals that of the 
photons (and for both, \ = h/p is the relationship between wavelength 
and momentum p). 

This section has described the Sisyphus cooling that arises through a 
combination of the spatially-varying dipole potential, produced by the 
polarization gradients, and optical pumping. It is a subtle mechanism 
and the beautifully-detailed physical explanation was developed in re- 
sponse to experimental observations. It was surprising that the small 
light shift in a low-intensity standing wave has any influence on the 
atoms.°° The recoil limit is an important landmark in laser cooling and 
the next section describes a method that has been invented to cool atoms 
below this limit. 


9.8 Raman transitions 


9.8.1 Velocity selection by Raman transitions 


Raman transitions involve the simultaneous absorption and stimulated 
emission by an atom. This process has many similarities with the two- 
photon transition described in Section 8.4 (see Appendix E). A coherent 
Raman transition between two levels with an energy difference of hwy» 


is illustrated in Fig. 9.20. For two beams of frequencies wp; and wy2 the 
condition for resonant excitation is 


U 
Wri + kyu — (Wye - kav) = Wi — WL + 2 (wr1 ap WL2) = W412. (9.56) 


For counter-propagating beams the Doppler shifts add to make the Ra- 
man transition sensitive to the velocity—about twice as sensitive as a 
single-photon transition.®* Direct excitation of the transition by radio- 
frequency radiation, or microwaves, at angular frequency w 2 is insen- 
sitive to the motion. The great advantage of the Raman technique for 
velocity selection (and cooling) arises from its extremely narrow line 
width (comparable with that of radio-frequency methods) of Raman 
transitions between levels that have long lifetimes, e.g. hyperfine levels 
in the ground configuration of atoms for which spontaneous decay is 
negligible. To fully exploit the advantage of this narrow line width, the 
difference in frequency between the two laser beams Aw = wy — wy2 
must be controlled very precisely. This can been achieved by taking 
two independent lasers and implementing sophisticated electronic servo- 
control of the frequency difference between them, but it is technically 
easier to pass a single laser beam through a phase modulator—the resul- 
tant frequency spectrum contains ‘sidebands’ whose difference from the 
original laser frequency equals the applied modulation frequency from a 
microwave source.®” The selected velocity v is determined by 


2kv = wy2 — (WL — Wi2); (9.57) 


where k = (wy + wy2) /¢c is the mean wavevector. 
Raman transitions between levels with negligible broadening from 
spontaneous decay or collisions have a line width determined by the 
interaction time: for a pulse of duration Tpulse the Fourier transform 

limit. gives®® 
Av cs il 


A Tpulse 


(9.58) 


For a visible transition with a wavelength of 600nm®® a pulse of duration 


9.8 Raman transitions 209 


Fig. 9.20 A Raman transition between 
levels 1 and 2 driven by two laser beams 
of (angular) frequencies wy; and wyp2. 
For a resonant Raman process the fre- 
quency detuning 6 ~ O, and the de- 
tuning A from the intermediate state 
remains large, so that excitation by 
single-photon absorption is negligible 
in comparison to the coherent transfer 
from |1) to |2). In this example the 
atom has velocity v along the direction 
of the laser beam with frequency wy, 
and the laser beam with frequency wy 
propagates in the opposite direction. 


661 contrast, two counter-propagating 
laser beams of the same frequency give 
Doppler-free two-photon spectra: 


wp +kv+ (wy — kv) = 2wz, 


as in eqn 8.20. If a two-photon transi- 
tion is excited by two laser beams with 
different frequencies then the Doppler 
shifts do not cancel exactly. 


67 For a laser beam with (angular) fre- 
quency w, phase modulation at fre- 
quency 2 leads to a spectrum contain- 
ing the frequencies w + nQ, with n in- 
teger. This can be used to carry out 
Raman excitation, e.g. with wp, = w 
and wpg =w—. 


68cimilar to that for the single-photon 
transition in Section 7.1.2. 


69 as in the case of sodium that was 
used in the first Raman experiments 
with cold atoms. In sodium, levels 
1 and 2 are the hyperfine levels with 
F = 1 and 2 of the 3s configuration, 
and the intermediate level i is 3p ?P3/2. 
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7A useful comparison can be made 
with the method of reducing the 
Doppler broadening shown in Fig. 8.2, 
in which a narrow slit is used to colli- 
mate an atomic beam and so reduce the 
spread of transverse velocities. 


71 Rquivalently, the atom’s momentum 
changes by hki — hko ~ 2hk. (Here 
wi2 Kw Ww.) 


72 Some atoms fall back into level 2 and 
are excited again until eventually they 
end up in level 1. Atoms that un- 
dergo more than one excitation receive 
additional impulses from the absorbed 
and emitted photons, which reduces the 
cooling efficiency but does not affect 
the principle. 


3 For velocity selection of atoms with 
v < 0 the direction of the beams is 
reversed so that these atoms are dis- 
tributed into the range v to v+2v, (that 
includes v = 0). 


Tpulse = 600 ps selectively transfers atoms in a range of width Av ~ 
lmms~!. This is about thirty times less than v, and equivalent to a 
‘temperature’ of T,/900 for the motion along the axis of the laser beams. 
This velocity selection does not produce any more cold atoms than at 
the start—it just separates the cold atoms from the others—so it has a 
different nature to the laser cooling processes described in the previous 
sections. 


9.8.2 Raman cooling 


The previous section showed that Raman transitions give the precision of 
radio-frequency spectroscopy combined with a sensitivity to the Doppler 
shift twice that of single-photon (optical) transitions. Raman cooling 
exploits the extremely high velocity resolution of coherent Raman tran- 
sitions to cool atoms below the recoil limit. The complete sequence of 
operations in Raman cooling is too lengthy to describe here, but the 
important principle can be understood by considering how atoms with 
a velocity distribution that is already below the recoil limit are cooled 
further. Figure 9.21 shows such an initial distribution in level 1 (the 
lower hyperfine level in the ground configuration of the atom; level 2 is 
the upper hyperfine level). Raman cooling uses the following steps. 


(a) Velocity selection by a Raman pulse that transfers atoms with ve- 
locities in the range from v — Av/2 to v+ Av/2 up to level 2, where 
they have velocities centred about v — 2u,. (The process of absorp- 
tion and stimulated emission in the opposite direction changes the 
atom’s velocity by 2v,.”*) 

(b) Atoms in level 2 are excited to level 7 by another laser beam and can 
decay back to level 1 with velocities centred around v — v, (includ- 
ing the change in velocity produced by absorption). Spontaneous 
emission goes in all directions so that the atoms return to level 1 
with velocities anywhere in the range v to v — 2v,.? 


It might appear that this cycle of a velocity-selective Raman pulse 
followed by repumping has made things worse since the final spread 
of velocities is comparable to, if not greater than the initial spread. 
Crucially, however, some atoms fall back into level 1 with velocities 
very close to zero so the number of very slow atoms has increased, and 
increases further for each repetition of the cycle with different initial 
velocity.’? Precise control of the Raman pulses ensures that atoms with 
velocities in the narrow range —dv < v < dv are never excited, so that 
after many cycles a significant fraction of the population accumulates in 
this narrow velocity class with dv < v,. In this Raman cooling process 
the atomic velocity undergoes a random walk until either it falls into the 
desired velocity class around v = 0, and remains there, or diffuses away 
to higher velocities. The recoil limit is circumvented because atoms 
with v ~ 0 do not interact with the light and this sub-recoil cooling 
mechanism does not involve a radiation force (in contrast to the Doppler 
and sub-Doppler cooling mechanisms described in previous sections). 
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—vur 0 » v 


More atoms 
in range 
—dvu to dv 


9.9 An atomic fountain 211 


(d) 
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Fig. 9.21 One step in the sequence of operations in Raman cooling. (a) Velocity selection by a Raman pulse that transfers 
atoms that have velocities within a certain narrow range from |1) to |2)—the process of absorption and stimulated emission 
changes the atomic velocity by —2v,. (b) Atoms are excited from level 2 to level i by another laser beam—in this process the 
atomic velocity changes by vr. (c) Atoms decay to level 1 by spontaneous emission—the recoil in a random direction means 
that the atoms return to level 1 with a component of velocity anywhere in the range v to v — 2u;, where v is the initial velocity. 
There are more atoms in the narrow velocity class around v = 0 than at the start of the sequence. (d) Repetition of the 
sequence with different initial velocities increases the number of atoms with v ~ 0 until they are ‘piled’ up in a distribution 


whose width is much less than the recoil velocity vy. 


The time taken for atoms to fall (randomly) into a velocity class of 
width 26v increases as du decreases and this determines the final velocity 
spread achievable by Raman cooling in practice. 

Raman cooling works well in one dimension, but it is much less effi- 
cient in three dimensions where the target is to have all three components 
Vz, Vy and v, between tdv. Another method of sub-recoil cooling called 
velocity-selective coherent population trapping is also a stochastic pro- 
cess, see Metcalf and van der Straten (1999) and Bardou et al. (1991). 
Raman transitions are also used for matter-wave interferometry based 
on ultra-cold atoms (Chapter 10). 


9.9 An atomic fountain 
The slow atoms produced by laser cooling have led to a dramatic im- 


provement in measurements whose resolution is limited by the interac- 
tion time. Cold atoms can be confined in dipole-force traps’* for long 


40Or magnetic traps as described in the 
next chapter. 
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Pin principle, the perturbation can be 
calculated and corrected for, but with- 
out perfect knowledge of the trapping 
potential this leaves a large uncertainty. 
There are currently proposals for fre- 
quency standards based on transitions 
in optically-trapped atoms for which 
the light shift cancels out, i.e. the lower 
and upper levels of the narrow transi- 
tion have very similar light shifts. 


76Tn such an atomic fountain the 
atoms have an initial velocity of vz = 
(29h)1/2 = 4ms7!. 


7’ Caesium has a resonance wavelength 
of 852 nm and a relative atomic mass of 
S38), 


8 Only the spread in the radial direc- 
tion leads to a loss of atoms, so velocity 
selection in two dimensions by Raman 
transitions, or otherwise, is useful. 


79 At the extraordinary precision of 
these experiments, collisions between 
ultra-cold caesium atoms cause an ob- 
servable frequency shift of the hyperfine 
transition (proportional to the density 
of the atoms). Therefore it is undesir- 
able for this density to change during 
the measurement. 


periods of time; however, the trapping potential strongly perturbs the 
atomic energy levels and hinders accurate measurements of the transi- 
tion frequencies.”> The highest-accuracy measurements use atoms in free 
fall, as shown in Fig. 9.16. This apparatus launches cold atoms upwards 
with velocities of a few ms~!, so that they travel upwards for a short dis- 
tance before turning around and falling back down under gravity—this 
forms an atomic fountain. 

A particularly important use of atomic fountains is to determine the 
frequency of the hyperfine-structure splitting in the ground configura- 
tion of caesium since this is used as the primary standard of time. Each 
atom passes through a microwave cavity on the way up and again on 
its way down, and these two interactions separated in time by T lead 
to Ramsey fringes (Fig. 7.3) with frequency width Af = 1/(2T), as 
described in Section 7.4. Simple Newtonian mechanics shows that a 
fountain of height h = 1m gives T = 2(2h/g)'/? ~ 1s, where g is the 
gravitational acceleration.’° This is several orders of magnitude longer 
than the interaction time for a thermal atomic beam of caesium atoms 
(Section 6.4.2). This is because the measurement time on Earth is lim- 
ited by gravity, and an obvious, but not simple, way to obtain further 
improvement is to put an apparatus into space, e.g. aboard a satellite 
or space station in orbit. Such an apparatus has the same components 
as an atomic fountain, but the atoms only pass once through the mi- 
crowave interaction region and are detected on the other side—pushing 
the atoms gently so that they move very slowly through the microwave 
cavity gives measurement times exceeding 10s. 

Cold atoms obtained by laser cooling are essential for both the atomic 
fountain and atomic clocks in space, as shown by the following estimate 
for the case of a fountain. The entrance and exit holes of the microwave 
cavity have a diameter of about lcm. If the atoms that pass through the 
cavity on the way up have a velocity spread about equal to the recoil ve- 
locity v; = 3.5mms~! for caesium,” then the cloud will have expanded 
by ~ 4mm by the time it falls back through the cavity.”* Thus a reason- 
able fraction of these atoms, that have a temperature close to the recoil 
limit, pass back through the cavity and continue down to the detection 
region. Clearly, for a 10s measurement time the effective temperature 
of the cloud needs to be well below the recoil limit.”? These general con- 
siderations show the importance of laser cooling for the operation of an 
atomic fountain. Some further technical details are given below. 

The atoms are launched upwards by the so-called ‘moving molasses’ 
technique, in which the horizontal beams in the six-beam configura- 
tion shown in Fig. 9.5 have angular frequency w, and the upward and 
downward beams have frequencies w + Aw and w — Aw, respectively. 
In a reference frame moving upwards with velocity v = (Aw/k)é, the 
Doppler shift is Aw, so that all the beams appear to have the same 
frequency. Therefore the optical molasses mechanisms damp the atomic 
velocity to zero with respect to this moving frame. These atoms have 
the same velocity spread about their mean velocity as atoms in the op- 
tical molasses technique with a stationary light field (Aw = 0), so the 


temperature is the same in both cases.®° 


In an atomic fountain the scheme for detecting that a microwave tran- 
sition has occurred is very different to that in an atomic beam (Sec- 
tion 6.4). The ground configuration of caesium has J = 1/2 (like all 
alkalis) and the two hyperfine levels are F = 3 and 4 (for the only sta- 
ble isotope that has nuclear spin J = 7/2). If the atoms start in the 
lower level F = 3 then the microwave radiation transfers a fraction of 
the atoms to the F = 4 level. This fraction is determined when the 
atoms fall through a laser beam that detects atoms in the F = 4 level, 
by exciting a transition from this level and monitoring the fluorescence, 
see Fig. 9.16. (Atoms in the F = 3 level pass through undetected.*!) 
Figure 7.3 shows a plot of the transition probability between the hyper- 
fine levels as a function of the microwave frequency—so-called Ramsey 
fringes. The narrow line width means that the frequency of the mi- 
crowave source used to drive the transition can be set very precisely 
to the caesium hyperfine frequency. Such an apparatus maintains the 
frequency of the microwave source stable to better than 1 part in 101°, 
or 32ns per year. Many causes of perturbations that might give fre- 
quency shifts are small because of the atoms’ low atomic velocity, but 
the Zeeman effect of magnetic fields remains a limitation. Experiments 
use the F = 3, Mp = Oto F = 4, Mp = O transition because states with 
Mr = 0 have no first-order Zeeman shift. Nowadays, such caesium foun- 
tain frequency standards play an important role in guiding the ensemble 
of clocks in national standards laboratories around the world that give 
agreed Universal Time.®” 


9.10 Conclusions 


The techniques that have been developed to reduce the temperature of 
atoms from 1000K to well below 1K have had an enormous impact 
on atomic physics. Laser cooling has made it possible to manipulate 
neutral atoms in completely new ways and to trap them by magnetic and 
dipole forces. Some important applications of atom trapping have been 
mentioned, such as the great improvement in precision measurements, 
and others are given in later chapters, e.g. Bose-Einstein condensation 
and the laser cooling of trapped ions. 

The important principles of radiation forces have been discussed, 
namely: the way in which the scattering force dissipates the energy 
of atoms and cools them to the Doppler cooling limit; the trapping of 
atoms by the dipole force in various configurations including optical lat- 
tices; and sub-Doppler cooling by the Sisyphus mechanism and sub-recoil 
cooling. This chapter greatly simplifies the real story of laser cooling for 
the sake of a clear presentation; the book Laser cooling and trapping of 
atoms by Metcalf and van der Straten (1999) gives a more comprehen- 
sive description of the important contributions made by many people 
and many references to other material—see also the review by Wieman 


9.10 Conclusions 213 


80-These caesium atoms have a veloc- 
ity spread of about 30; ~ 10mms7!?. 
These atoms could be used directly if 
the measurement time is 0.38, but there 
would be a large loss of atoms in a 
higher fountain with T = 1s. 


81-T > normalise the signal the atoms in 
the F = 3 level are detected with a sec- 
ond probe laser beam (not shown in the 
figure). 


82The important uses of such clocks 
were given in Section 6.4.2 and up-to- 
date information can be found on the 
web sites of national standards labora- 
tories. 
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et al. (1999). Various internet resources and popular descriptions can 
be found on the web site of the Nobel foundation. 
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Exercises 


More advanced problems are indicated by a *. 


(9.1) Radiation pressure 


(9.2 


) 


ee 


What force does radiation exert on the head of a 
person wearing a black hat of radius 15cm when 
the sun is directly overhead. Estimate the ratio of 
this radiation force to the weight of the hat. 


An argument for photon momentum (due to 
Enrico Fermi) 

An atom moving at velocity v absorbs a pho- 
ton propagating in the opposite direction (as in 
Fig. 9.1). In the laboratory frame of reference the 
photon has (angular) frequency w and momentum 
Gph- In the rest frame of the atom the photon has 
(angular) frequency wo, where hwo = E2 — Ej is 
the energy of the (narrow line width) transition 
between levels 1 and 2. After the absorption the 
system has a total energy of 4M(v—Av)* + hwo. 


(a) Write down the equations for conservation of 
energy and momentum. 

(b) Expand the equation for conservation of en- 

ergy, neglecting the term of order (Av)”. (The 

change in velocity Av is small compared to v.) 

(c) Use the usual expression for the fractional 

Doppler shift (w — wo)/w = v/c to find an ex- 


pression for the photon momentum gph. 


Heating from photon recoil 

This exercise is based on a treatment of laser cool- 
ing by Wineland and Itano (1979). The angular 
frequencies of radiation absorbed and emitted by 
an atom are given by 


1 2 £8 
Wabs = Wo + Kabs - v — swo() +=, 

7 G h 

a v'\2 #E, 
len: =o Kem = 50\ aaa 


where |Kaps| = wabs/c and |Kem| = Wem/e are the 
wavevectors of the absorbed and emitted photons, 
respectively, v is the velocity of the atom before 
the photon is absorbed, and similarly v’ is the ve- 
locity of the atom before emission. Prove these re- 
sults from conservation of (relativistic) energy and 


(9.4) 


(9.5) 


(9.6) 


(9.7) 


momentum (keeping terms of order (v/c)? in the 
atomic velocity and E,/hwo in the recoil energy). 
Averaged over many cycles of absorption and emis- 
sion, the kinetic energy of the atom changes by 


AEF xe = R(Wabs a Wem) = Akaps > V+ 2E 


for each scattering event. Show that this result 
follows from the above equations with certain as- 
sumptions, that should be stated. Show that, 
when multiplied by the scattering rate Recatt, the 
terms hkaps - v and 2E, give cooling and heating 
at comparable rates to those derived in the text 
for the optical molasses technique. 


The angular momentum of light 

An atom in a ‘Spo level is excited to a state with 
L =1, Mr = 1 by the absorption of a photon (a 
o* transition). What is the change in the atomic 
angular momentum? 

A laser beam with a power of 1W and a wave- 
length of 600 nm passes through a waveplate that 
changes the polarization of the light from linear to 
circular. What torque does the radiation exert on 
the waveplate? 


Slowing H and Cs with radiation 

Atomic beams of hydrogen and caesium are pro- 
duced by sources at 300 K and slowed by counter- 
propagating laser radiation. In both cases calcu- 
late (a) the stopping distance at half of the max- 
imum deceleration, and (b) compare the Doppler 
shift at the initial velocity with the natural width 
of the transition. (Data are given in Table 9.1.) 


The Doppler cooling and recoil limits 
Calculate the ratio Tp/T, for rubidium (from 
eqns 9.28, 9.55 and the data in Table 9.1). 


Damping in the optical molasses technique 


(a) For the particular case of a frequency detuning 
of 6 = —I’/2 the slope of the force versus ve- 
locity curve, shown in Fig. 9.6, at v = 0 equals 
the peak force divided by I'/(2k). Use this to 


(9.8) 


(9.9) 


estimate OF /Ov and hence to determine the 
damping coefficient a for an atom in a pair of 
counter-propagating laser beams, under these 
conditions. 


(b) Estimate the damping time for a sodium atom 
in the optical molasses technique when each 
laser beam has intensity 2Jsa, and 6 = —I'/2. 


Laser cooling of a trapped ion 

A trapped Ca* ion undergoes simple harmonic 
motion with an oscillation frequency of 2. = 27 x 
100kHz. The ion experiences a radiation force 
from laser light of wavelength 393nm and inten- 
sity J that excites a transition with T = 27 x 23 x 
10°s~*. The frequency detuning 5 does not de- 
pend on the ion’s position within the trap. 


(a) Show that the force on the ion has the form 
F = —k(z — 2) — av. Describe the ion’s mo- 
tion. 


(b) Find the static displacement zo of the ion from 
the centre of the harmonic potential, along the 
direction of the laser beam, for 6 = —I'/2 and 
IE = ileate 


(c) Show that, to a good approximation, the 
damping coefficient can be written in the form 
ry 

a x son (9.59) 
where the variables x and y are proportional 
to 6 and J, respectively. Maximise this func- 
tion of two variables and hence determine the 
intensity and frequency detuning that give the 

maximum value of a. 


(d) The kinetic energy of small oscillations about 
zo decays with a damping time of Taamp = 
M/a. Show that this damping time is in- 
versely proportional to the recoil energy.*° 
Evaluate this minimum value of Tgamp for a 
calcium ion of mass M ~ 40 a.m.u. 


Comment. This treatment of Doppler cooling for 
a single laser beam is accurate for any intensity 
(even above Isat), whereas the approximation that 
two laser beams (as in the optical molasses tech- 
nique) give twice as much damping as a single 
beam is not accurate at high intensities. 


The properties of a magneto-optical trap 


(a) Obtain an expression for the damping co- 
efficient a for an atom in two counter- 
propagating laser beams (each of intensity J), 
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taking into account saturation. (Use the re- 
sults of the previous exercise with the modifi- 
cation I — 2] in both the numerator and de- 
nominator, or otherwise.) Determine the min- 
imum damping time (defined in eqn 9.19) of 
a rubidium atom in the optical molasses tech- 
nique (with two laser beams). 


(b) The force on an atom in an MOT is given by 
eqn 9.30. Assume the worst-case scenario in 
the calculation of the damping and the restor- 
ing force, along a particular direction, i.e. that 
the radiation force arises from two counter- 
propagating laser beams (each of intensity I) 
but the saturation of the scattering rate de- 
pends on the total intensity 6/ of all six laser 
beams. Show that the damping coefficient can 
be written in the form 


a 
(1+y+22)?’ 

where x = 26/T and y = 61 /Isat. Using the 
results of the previous exercise, or otherwise, 
determine the nature of the motion for a ru- 
bidium atom in an MOT with the values of 
I and 6 that give maximum damping, and a 
field gradient 0.1T m7? (in the direction con- 
sidered). 


(9.10) Zeeman slowing in a magneto-optical trap 


(a) Instead of the optimum magnetic field profile 
given in eqn 9.11, a particular apparatus to 
slow sodium atoms uses a linear ramp 


z 

B(z) = Bo @ 2 =) 

for 0 < z < L, and B(z) = 0 outside this 

range. Explain why a suitable value for Bo 

is the same as in eqn 9.12. Show that the 

minimum value of L is 2Lo, where Lp is the 
stopping distance for the optimum profile. 


(b) The capture of atoms by a magneto-optical 
trap can be considered as Zeeman slowing in 
a uniform magnetic field gradient, as in part 
(a). In this-situation the maximum veloc- 
ity captured by an MOT with laser beams 
of radius 0.5cm is equivalent to the velocity 
of atoms that come to rest in a distance of 


83Surprisingly, the damping time does not depend on the line width of the transition [., but narrow transitions lead to a 
small velocity capture range. 


216 Laser cooling and trapping 


Lo = 0.25cm for constant deceleration at half Show that the equilibrium number of atoms in 
the maximum value. Use this simple model of the MOT is independent of the vapour pres- 
a trap to calculate the capture velocity for ru- sure. 

bidium atoms. What is a suitable value for the (c) Atoms enter the trapping region over a sur- 
gradient of the magnetic field, Bo/L (where facearen A=6D?. Au MOU with D —2cn 
L = 210)? (Data are given in Table 9.1.) has ve ~ 25ms~! for rubidium. Make a rea- 


sonable estimate of the cross-section o for col- 
lisions between two atoms and hence find the 
equilibrium number of atoms captured from a 


Comment. The magnetic field gradients in a 
magneto-optical trap are much less than those in 
magnetic traps (Chapter 10), but the force (from 


pee : t ture.°4 
the radiation) is much stronger than the magnetic low Bese Ute Ne Ot OC ee 
force. (9.12) Optical absorption by cold atoms in an MOT 
(9.11) The equilibrium number of atoms in an MOT In a simplified model the trapped atoms are con- 
The steady-state number of atoms that congregate sidered as a spherical cloud of uniform density, ra- 
at the centre of an MOT is determined by the bal- dius r and number N. 


ance between the loading rate and the loss caused 
by collisions. To estimate this equilibrium number 
N, we consider the trapping region formed at the 
intersection of the six laser beams of diameter D 
as being approximately a cube with sides of length AI | No(w) 


D. This trapping region is situated in a cell filled i, = amore 
with a low-pressure vapour of number density NV. 


(a) Show that a laser beam of (angular) frequency 
w that passes through the cloud has a frac- 
tional change in intensity given by 


(The optical absorption cross-section o(w) is 
(a) The loading rate can be estimated from the given by eqn 7.76.) 
kinetic theory expression }NvAf(v) for the (b) 


Absorption will significantly affect the oper- 
rate at which atoms with speed v hit a sur- P 2 7 E 


; ; : ation of the trap when AJ ~ Ip. Assuming 
on of at A in a Bas; (v) is the ne that this condition limits the density of the 
: ae a al ie = ee dD be peo cloud for large numbers of atoms,®° estimate 
(eqn 8.3). Integrate this rate from v = 0 up the radius and density for a cloud of V = 10° 


te ple capuate velocity oe OE ietues SxPres- rubidium atoms and a frequency detuning of 
sion for the rate at which the MOT captures ae 


atoms from the vapour. (The integration can 
be made simple by assuming that ve < vp.) (9.13) Laser cooling of atoms with hyperfine structure 


(b) Atoms are lost (‘knocked out of the trap’) by The treatment of Doppler cooling given in the text 
collisions with fast atoms in the vapour at a assumes a two-level atom, but in real experiments 
rate N = —Nvo, where 0 is the mean velocity with the optical molasses technique, or the MOT, 
in the vapour and a is a collision cross-section. the hyperfine structure of the ground configuration 


causes complications. This exercise goes through 


°4The equilibrium number in the trap is independent of the background vapour pressure over a wide range, between (i) the 
pressure at which collisions with fast atoms knock cold atoms out of the trap before they settle to the centre of the trap, and (ii) 
the much lower pressure at which the loss rate from collisions between the cold atoms within the trap itself becomes important 
compared to collisions with the background vapour. 

85 Comment. Actually, absorption of the laser beams improves the trapping—for an atom on the edge of the cloud the light 
pushing outwards has a lower intensity than the unattenuated laser beam directed inwards —however, the spontaneously-emitted 
photons associated with this absorption do cause a problem when the cloud is ‘optically thick’, i.e. when most of the light is 
absorbed. Under these conditions a photon emitted by an atom near the centre of the trap is likely to be reabsorbed by another 
atom on its way out of the cloud—this scattering within the cloud leads to an outward radiation pressure (similar to that in 
stars) that counteracts the trapping force of the six laser beams. (The additional scattering also increases the rate of heating 
for a given intensity of the light.) In real experiments, however, the trapping force in the MOT is not spherically symmetric, 
and there may be misalignments or other imperfections of the laser beams, so that for conditions of high absorption the cloud 
of trapped atoms tends to become unstable (and may spill out of the trap). 


(9.14) 


(9.15) 


some of the nitty-gritty details and tests under- 
standing of hyperfine structure.*° 


(a) Sodium has a nuclear spin I = 3/2. Draw an 
energy level diagram of the hyperfine structure 
of the 3s 7S, 2 and 3p ’P37» levels and indicate 
the allowed electric dipole transitions. 


In a laser cooling experiment the transition 
35 or F = 2 to 3p “Ps, F' = 3 is ex- 
cited by light that has a frequency detuning 
of 6 = —I'/2 ~ —5 MHz (to the red of this 
transition). Selection rules dictate that the 
excited state decays back to the initial state, 
so there is a nearly closed cycle of absorption 
and spontaneous emission, but there is some 
off-resonant excitation to the F’ = 2 hyperfine 
level which can decay to F = 1 and be ‘lost’ 
from the cycle. The F’ = 2 level lies 60 MHz 
below the F’ = 3 level. Estimate the aver- 
age number of photons scattered by an atom 
before it falls into the lower hyperfine level of 
the ground configuration. (Assume that the 
transitions have similar strengths.) 


To counteract the leakage out of the laser 
cooling cycle, experiments use an additional 
laser beam that excites atoms out of the 
3s *Sij2, F = 1 level (so that they can get 
back into the 3s solos F = 2 level). Sug- 
gest a suitable transition for this ‘repumping’ 
process and comment on the light intensity re- 
quired.®” 


The gradient force 

Figure 9.11 shows a sphere, with a refractive index 
greater than the surrounding medium, that feels 
a force towards regions of high intensity. Draw 
a similar diagram for the case Nsphere < Nmedium 
and indicate forces. (This object could be a small 
bubble of air in a liquid.) 


Dipole-force trap 
A laser beam®® propagating along the z-axis has 
an intensity profile of 


= on oan (-3) (9.60) 


(9.16) 


(9.17) 


Exercises for Chapter 9 217 


where r? = x” + y” and the waist size is w(z) = 
wo (ime /b7) with b = mwé/d”. This laser 
beam has a power of P = 1 W at a wavelength of 
A = 1.06 wm, and a spot size of wo = 10 pm at the 
focus. 


(a) Show that the integral of I(r,z) over any 
plane of constant z equals the total power 
of the beam P. 


Calculate the depth of the dipole potential 
for rubidium atoms, expressing your answer 
as an equivalent temperature. 


(b) 


(c) For atoms with a thermal energy much lower 
than the trap depth (so that r? < wé and 
z? <b’), determine the ratio of the size of 
the cloud in the radial and longitudinal di- 


rections. 


+(d) Show that the dipole force has a maximum 
value at a radial distance of r = wo/2. Find 
the maximum value of the waist size wo for 
which the dipole-force trap supports rubid- 
ium atoms against gravity (when the laser 


beam propagates horizontally). 


An optical lattice 

In a standing wave of radiation with a wavelength 
of A = 1.06 wm, a sodium atom experiences a peri- 
odic potential as in eqn 9.52 with Up = 100E,, 
where E, is the recoil energy (for light at the 
atom’s resonance wavelength \ = 0.589 ym). Cal- 
culate the oscillation frequency for a cold atom 
trapped near the bottom of a potential well in 
this one-dimensional optical lattice. What is the 
energy spacing between the low-lying vibrational 
levels? 


The potential for the dipole force 
Show that the force in eqn 9.43 equals the gradient 
of the potential 


Wace — ——— 


For what conditions does eqn 9.46 give a good ap- 
proximation for Uaipole? 


86The transfer between different hyperfine levels described here is distinct from the transfer between different Zeeman sub- 
levels (states of given Mz or Mj) in the Sisyphus effect. 

87In the Zeeman slowing technique the magnetic field increases the separation of the energy levels (and also uncouples the 
nuclear and electronic spins in the excited state) so that ‘repumping’ is not necessary. 

88 This is a diffraction-limited Gaussian beam. 
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1There was early work on magnetic 
trapping of ultra-cold neutrons whose 
magnetic moment is only —1.9 un, and 
the nuclear magneton py, which is 
much smaller than the Bohr magneton 


UB: 


Magnetic trapping, 
evaporative cooling and 
Bose-Einstein 
condensation 


Magnetic traps are used to confine the low-temperature atoms produced 
by laser cooling. If the initial atomic density is sufficiently high, the 
simple but extremely effective technique of evaporative cooling allows 
experiments to reach quantum degeneracy where the occupation of the 
quantum states approaches unity. This leads either to Bose-Einstein 
condensation (BEC) or to Fermi degeneracy, depending on the spin of 
the atoms. This chapter describes magnetic traps and evaporative cool- 
ing, using straightforward electromagnetism and kinetic theory, before 
giving an outline of some of the exciting new types of experiments that 
have been made possible by these techniques. The emphasis is on pre- 
senting the general principles and illustrating them with some relevant, 
examples rather than attempting to survey the whole field in a qualita- 
tive way. 


10.1 Principle of magnetic trapping 


In their famous experiment, Otto Stern and Walter Gerlach used the 
force on an atom as it passed through a strong inhomogeneous magnetic 
field to separate the spin states in a thermal atomic beam. Magnetic 
trapping uses exactly the same force, but for cold atoms the force pro- 
duced by a system of magnetic field coils bends the trajectories right 
around so that low-energy atoms remain within a small region close to 
the centre of the trap. Thus the principle of magnetic trapping of atoms 
has been known for many years but it only became widely used after the 
development of laser cooling.’ 
A magnetic dipole yz in a field has energy 


V=-p-B. (10.1) 


For an atom in the state |J JF Mp) this corresponds to a Zeeman energy 


Ve grLUBMrFB c (10.2) 


10.1 


The energy depends only the magnitude of the field B = |B]. The energy 
does not vary with the direction of B because as the dipole moves (adia- 
batically) it stays aligned with the field. From this we find the magnetic 
force along the z-direction: 


dB 

i gruBpMer Ae (10.3) 
Example 10.1 Estimate of the effect of the magnetic force on an atom 
of mass M that passes through a Stern Gerlach magnet at speed v. 
The atom takes a time t = L/v to pass through a region of high field 
gradient of length LZ (between the pole pieces of the magnet), where 
it receives an impulse Ft, in the transverse direction, that changes its 
momentum by Ap = Ft. An atom with momentum p = Mv along the 
beam has a deflection angle of 


g — AP _ BE El 
~~ p Mv? 2B ye! 


(10.4) 


The kinetic energy Exe ~ 2kpT (from Table 8.1), where T is the temper- 
ature of the oven from which the beam effuses. An atom with a single 
valence electron has a maximum moment of yp (when gr Mp = 1) and 
hence 


_ wp dB 3x 0.1 
cae a axa 


~1.4x10~*rad. (10.5) 


This evaluation for a field gradient of 3T m7! over L = 0.1m and T = 
373K makes use of the ratio of the Bohr magneton to the Boltzmann 
constant, given by 
FE —067KT-!. (10.6) 
kp 
Thus a well-collimated beam of spin-1/2 atoms that propagates for L = 
1m after the magnet will be split into two components separated by 
20L = 0.3mm. 

For an atom with T ~ 0.1K eqn 10.5 gives a deflection of 0.5 rad! 
Although the equation is not valid for this large angle, it does indicate 
that magnetic forces have a strong influence on laser-cooled atoms and 
can bend their trajectories around. From eqn 10.6 we see directly that 
a magnetic trap where the field varies from 0 to 0.03T has a depth of 
0.02K, e.g. a trap with a field gradient of 3Tm7~! over 10mm, as de- 
scribed in the next section. Remember that the Doppler cooling limit 
for sodium is 240K, so it is easy to capture atoms that have been 
laser cooled. Traps made with superconducting magnetic coils can have 
fields of over 10T and therefore have depths of several kelvin; this en- 
ables researchers to trap species such as molecules that cannot be laser 
cooled.” 
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Standard laser cooling does not work 
for molecules because repeated sponta- 
neous emission causes the population 
to spread out over many vibrational 
and rotational levels. Superconducting 
traps operate at low temperatures with 
liquid helium cooling, or a dilution re- 
frigerator, and molecules are cooled to 
the same temperature as the surround- 
ings by buffer gas cooling with a low 
pressure of helium (cf. Section 12.4). 
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(a) 


10.2 Magnetic trapping 


10.2.1 Confinement in the radial direction 


The estimate in the introduction shows that magnetic forces have a 
significant effect. on cold atoms and in this section we examine the specific 
configuration used to trap atoms shown in Fig. 10.1. The four parallel 
wires arranged at the corners of a square produce a quadrupole magnetic 
field when the currents in adjacent wires flow in opposite directions. 
Clearly this configuration does not produce a field gradient along the 
axis (z-direction); therefore from Maxwell’s relation div B = 0 we deduce 
ae dB, dB, 


=-—4=0'. 
daz dy 


These gradients have the same magnitude 6’, but opposite sign. There- 
fore the magnetic field has the form 


B = (28, — yey) + Bo. (10.7) 


Here we simply assume that 6’ = 3Tm7!; that this is a realistic field 
gradient can be shown by using the Biot—Savart law to calculate the field 
produced by coils carrying reasonable currents (see caption of Fig. 10.1). 
In the special case of Bo = 0, the field has a magnitude 


|B] = b' (a? + y?)1/? = d'r. (10.8) 


Thus the magnetic energy (eqn 10.2) has a linear dependence on the 
radial coordinate r = \/z? + y?. This conical potential has the V-shaped 
cross-section shown in Fig. 10.2(a), with a force in the radial direction 
of 

VV —grupMrb'e, . (10.9) 


Fig. 10.1 (a) A cross-section through four parallel straight wires, with currents into and out of the page as indicated. These 
give a magnetic quadrupole field. In a real magnetic trap, each ‘wire’ is generally made up of more than ten strands, each of 
which may conduct over 100 amps, so that the total current along each of the four wires exceeds 1000 amps. (b) The direction 
of the magnetic field around the wires—this configuration is a magnetic quadrupole. 


3V 


This force confines atoms in a low-field-seeking state, i.e. one with gre Mp 
> 0, so that the magnetic energy decreases as the atom moves into a 
lower field (see Fig. 6.10, for example). However, a quadrupole field has 
a fundamental problem—the atoms congregate near the centre where 
B = 0 and the Zeeman sub-levels (| JF Mp) states) have a very small 
energy separation. In this region of very low magnetic field the states 
with different magnetic quantum numbers mix together and atoms can 
transfer from one value of Mr to another (e.g. because of perturbations 
caused by noise or fluctuation in the field). These non-adiabatic tran- 
sitions allow the atoms to escape and reduce the lifetime of atoms in 
the trap. The behaviour of atoms in this magnetic trap with a leak at 
the bottom resembles that of a conical funnel filled with water—a large 
volume of fluid takes a considerable time to pass through a funnel with 
a small outlet at its apex, but clearly it is desirable to plug the leak at 
the bottom of the trap.® 

The loss by non-adiabatic transitions cannot be prevented by the addi- 
tion of a uniform field in the z- or y-directions, since this simply displaces 
the line where B = 0, to give the same situation as described above (at 
a different location). A field Bo = Boe, along the z-axis, however, has 
the desired effect and the magnitude of the field in eqn 10.7 becomes 


2.2 

1/2 be 
~B : 
0+ Bo 


This approximation works for small r where b’r < Bo. The bias field 
along z rounds the point of the conical potential, as illustrated in 
Fig. 10.2(b), so that near the axis the atoms of mass M see a harmonic 
potential. From eqn 10.2 we find 


|B] = {Bo + (b'r)"} 


(10.10) 


1 
V(r) =Vo+ a Murr (10.11) 
The radial oscillation has an angular frequency given by 
M 
i= aaa xb. (10.12) 


10.2.2 Confinement in the axial direction 


The Joffe trap, shown in Fig. 10.3, uses the combination of a linear 
magnetic quadrupole and an axial bias field described above to give 
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Fig. 10.2 (a) A cross-section through 
the magnetic potential (eqn 10.8) in a 
radial direction, e.g. along the x- or y- 
axis. The cusp at the bottom of the 
conical potential leads to non-adiabatic 
transitions of the trapped atoms. (b) A 
bias field along the z-direction rounds 
the bottom of the trap to give a har- 
monic potential near the axis (in the 
region where the radial field is smaller 
than the axial bias field). 


3These losses prevent the spherical 
quadrupole field configuration of the 
two coils in the MOT being used di- 
rectly as a magnetic trap—the MOT 
operates with gradients of 0.1 Tm! so 
thirty times more current-turns are re- 
quired in any case. We do not dis- 
cuss the addition of a time-varying field 
to this configuration that leads to the 
TOP trap used in the first experimental 
observation of Bose-Einstein condensa- 
tion in the dilute alkali vapours (Ander- 
son et al. 1995). 
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loffe coils 


ee 


Compensation coils 


Fig. 10.3 An Joffe -Pritchard magnetic trap. The fields produced by the various coils are explained in more detail in the text 
and the following figures. This loffe trap is loaded with atoms that have been laser cooled in the way shown in Fig. 10.5. 
(Figure courtesy of Dr Kai Dieckmann.) 


Fig. 10.4 The pinch coils have currents 
in the same direction and create a mag- 
netic field along the z-axis with a min- 
imum midway between them, at z = 0. 
This leads to a potential well for atoms 
in low-field-seeking states along this ax- 


ial direction. By symmetry, these co- ‘ 
axial coils with currents in the same di- 
rection give no gradient at z= 0. 


radial confinement for atoms in low-field-seeking states. To confine these 
atoms in the axial direction the trap has two pairs of co-axial coils with 
currents that flow in the same direction to produce a field along the z- 
axis whose magnitude is shown in Fig. 10.4. These so-called pinch coils* 
have a separation greater than that of Helmholtz coils, so the field along 
z has a minimum midway between the coils (where dB,/dz = 0). The 
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4The term ‘pinch coils’ arises from the 
concept of pinching off the ‘magnetic 
tube’ containing the atoms. The loffe 
trap configuration was originally devel- 
oped to confine plasma. 


Fig. 10.5 A general view of an appa- 
ratus to load an loffe-Pritchard mag- 
netic trap with laser-cooled atoms from 
a magneto-optical trap. (The MOT 
has a different arrangement of coils to 
that described in Section 9.4 but the 
same principle of operation.) This ex- 
perimental apparatus was constructed 
by the team of Professor Jook Wal- 
raven at the FOM Institute, Amster- 
dam. (Figure courtesy of Dr Kai Dieck- 
mann, Dieckmann et al. (1998).) Copy- 
right 1998 by the American Physical 
Society. 
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5In practice, the compensation coils 
need not have exactly the Helmholtz 
spacing; the current in these coils to- 
gether with pinch coil current gives 
two experimental parameters that al- 
low the adjustment of the magnitude 
and field curvature along z to any de- 
sired value (limited by the maximum 
current through the pairs of coils). Nei- 
ther pair of these coils gives a field gra- 
dient, by symmetry. 


6 Temperature is only defined at ther- 
mal equilibrium and in other situations 
the mean energy per atom should be 
used. 


Tan exponential distribution extends 
to infinity, and so for any value of Eeut 
(or n) there is always some probability 
of atoms having a higher energy; how- 
ever, the removal of a very small frac- 
tion of atoms has a little effect when av- 
eraged over the remaining atoms. Ex- 
ercise 10.4 compares different depths of 
cut. 


field has the form 
dB. 2" 
dz? 27 
This gives a corresponding minimum in the magnetic energy and hence 
a harmonic potential along the z-axis. Typically, the Ioffe trap has an 
axial oscillation frequency w, an order of magnitude lower than w, (= 
We = Wy), eg. W,/2m = 15 Hz and w,/2n = 200Mz (see Exercise 10:1) 
Thus the atoms congregate in a cigar-shaped cloud along the z-axis. The 
curvature of the magnetic field along z depends only on the dimensions 
of the pinch coils and their current. Therefore a uniform field along z 
does not affect w,, but it does change w, through the dependence on Bo 
in eqn 10.12. The pairs of compensation coils shown in Fig. 10.3 create 
a uniform field along the z-axis that opposes the field from the pinch 
coils. This allows experimenters to reduce Bo and make the trap stiff in 
the radial direction.° 

To load the approximately spherical cloud of atoms produced by op- 
tical molasses, the Ioffe trap is adjusted so that w, ~ w,. After loading, 
an increase in the radial trapping frequency, by reducing the bias field 
Bo (see eqn 10.12), squeezes the cloud into a long, thin cigar shape. This 
adiabatic compression gives a higher density and hence a faster collision 
rate for evaporative cooling. 


(10.13) 


Bpinch (z) = Bopinch (0) + 


10.3. Evaporative cooling 


Laser cooling by the optical molasses technique produces atoms with a 
temperature below the Doppler limit, but considerably above the recoil 
limit. These atoms can easily be confined in magnetic traps (as shown 
in Section 10.1) and evaporative cooling gives a very effective way of 
reducing the temperature further. In the same way that a cup of tea 
loses heat as the steam carries energy away, so the cloud of atoms in a 
magnetic trap cools when the hottest atoms are allowed to escape. Each 
atom that leaves the trap carries away more than the average amount of 
energy and so the remaining gas gets colder, as illustrated in Fig. 10.6. 
A simple model that is useful for understanding this process (and for 
quantitative calculations in Exercise 10.4) considers evaporation as a 
sequence of steps. At the start of a step the atoms have a Boltzmann 
distribution of energies (FE) = No exp(—E/kpT;) characteristic of a 
temperature 7. All atoms with energies greater than a certain value 
FE > Ecut are allowed to escape, where Fou; = nkpT, and typically the 
parameter 7 lies in the range 7 = 3-6; this truncated distribution has 
less energy per atom than before the cut so that, after collisions between 
the atoms have re-established thermal equilibrium, the new exponential 
distribution has a lower temperature T2 < T,.° The next step removes 
atoms with energies above nkpT>2 (a lower energy cut-off than in the first 
step) to give further cooling, and so on.’ For many small steps this model 
gives a reasonable approximation to real experiments where evaporation 
proceeds by a continuous ramping down of Ecut that cuts away atoms at 


(a). . 6) 


aga: Eeout 


#(E) #(E) (EB) 


the edge of the cloud (without stopping to allow rethermalisation). The 
rate of this evaporative cooling ramp depends on the rate of collisions 
between atoms in the trap.® 

During evaporation in a harmonic trap the density increases (or at 
least stays constant) because atoms sink lower in the potential as they 
get colder. This allows runaway evaporation that reduces the tempera- 
ture by many orders of magnitude, and increases the phase-space density 
to a value at which quantum statistics becomes important.? 

Evaporation could be carried out by turning down the strength of the 
trap, but this reduces the density and eventually makes the trap too weak 
to support the atoms against gravity. (Note, however, that this method 
has been used successfully for Rb and Cs atoms in dipole-force traps.) In 
magnetic traps, precisely-controlled evaporation is carried out by using 
radio-frequency radiation to drive transitions between the trapped and 
untrapped states, at a given distance from the trap centre, i.e. radiation 
at frequency Ww; drives the AMp = +1 transitions at a radius r that 
satisfies grupb’r = hw,¢. Hot atoms whose oscillations extend beyond 
this radius are removed, as shown in the following example. 
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Fig. 10.6 (a) A schematic representa- 
tion of atoms confined in a harmonic 
potential. (b) The height of the po- 
tential is reduced so that atoms with 
above-average energy escape; the re- 
maining atoms have a lower mean en- 
ergy than the initial distribution. The 
evolution of the energy distribution 
is shown below: (c) shows the ini- 
tial Boltzmann distribution f(£) = 
exp(—E/kpTi); (d) shows the trun- 
cated distribution soon after the cut, 
when the hot atoms have escaped; and 
(e) shows the situation some time later, 
after collisions between the remain- 
ing atoms have re-established a Boltz- 
mann distribution at a temperature T> 
less than Tj. In practice, evapora- 
tive cooling in magnetic traps differs 
from this simplified picture in two re- 
spects. Firstly, the potential does not 
change but atoms leave the trap by un- 
dergoing radio-frequency transitions to 
untrapped states at a certain distance 
from the trap centre (or equivalently 
at a certain height up the sides of the 
potential). Secondly, cooling is carried 
out continuously rather than as a series 
of discrete steps. 


Sif the process is carried out too rapidly 
then the situation becomes similar to 
that for a non-interacting gas (with no 
collisions) where cutting away the hot 
atoms does not produce any more low- 
energy atoms than there are initially. It 
just selects the coldest atoms from the 
others. 


°In contrast, for a square-well potential 
the density and collision rate decrease 
as atoms are lost, so that evaporation 
would grind to a halt. In the initial 
stages of evaporation in an Ioffe trap, 
the atoms spread up the sides of the 
potential and experience a linear po- 
tential in the radial direction. The lin- 
ear potential gives a greater increase in 
density for a given drop in temperature 
than a harmonic trap, and hence more 
favourable conditions to start evapora- 
tion. 
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10This is approximately equal to the 
Doppler cooling limit for sodium (but 
see Exercise 10.2(c)). 


11 Good images were obtained from 
2000 rubidium atoms in the first BEC 
experiment, and in principle it is possi- 
ble to detect even a single atom. 


12Contributions to the width of the 
radio-frequency transitions between 
Zeeman sub-levels arise from power 
broadening and fluctuations (noise) in 
the magnetic field. More usually, radio- 
frequency spectroscopy has a resolution 
limited by the interaction time, but this 
is less important for trapped atoms and 
continuous radiation. 


Example 10.2 For an atom with gy = 1/2 (as in sodium) in a trap 
with b' = 3Tm7!, the frequency varies with position as grypb'/h = 
21GHzm~! (wp = 14GHzT~'). The application of radio-frequency ra- 
diation at 40 MHz removes atoms over a surface whose cross-section 
in the plane z = 0 is a circle of radius r = 2mm. Sweeping the 
radio-frequency radiation down to 20 MHz reduces the radius to r = 
1mm. This estimate assumes that the cutting surface (where the radio- 
frequency radiation removes atoms from the cloud) lies in the region 
where the magnetic field is linear b’r >> Bo; this is the opposite of the 
condition that gives the harmonic approximation in eqn 10.10. For a 
bias field of Bp = 3 x 1074 T and the field gradient 6’ above, the trap- 
ping potential is linear for r >> 0.1mm, so our assumption of a linear 
field was valid. As evaporation proceeds, the atoms sink further down 
in the trap and the cross-over from a linear to a harmonic potential oc- 
curs when the cloud of atoms has a radius of r = Bo/b! = 0.1mm (see 
Fig. 10.2(b)). From eqn 10.6 we find that in a linear trap this would 
correspond to a cloud with a temperature of 2 x 10~* K.1° 


Evaporative cooling has no fundamental lower limit and temperatures 
below 10nK have been reached in magnetic traps. This is sufficient for 
the experiments discussed here, but let us consider briefly what limita- 
tions might arise in practice: (a) for a given set of starting conditions, it 
is not worthwhile to go beyond the point at which the number of trapped 
atoms becomes too low to detect;'! (b) when the energy resolution of 
the radio-frequency transition is similar to the energy of the remaining 
atoms it is no longer possible to selectively remove hot atoms whilst 
leaving the cold atoms—colloquially, this is referred to as the radio- 
frequency ‘knife’ being blunt so that it cannot shave off atoms from the 
edges of the cloud;!* and (c) in the case of fermions, it is difficult to 
cool atoms well below the Fermi temperature Jp at which quantum de- 
generacy occurs because, when almost all the states with energy below 
kpTp are filled (with the one atom in each state allowed by the Pauli 
exclusion principle), there is a very low probability of an atom going 
into an unoccupied state (‘hole’) in a collision. The case of bosons is 
discussed in the next section. 

The temperature of a cloud of trapped atoms can be reduced by an 
adiabatic expansion of the cloud, but, by definition, an adiabatic process 
does not change the phase-space density (or equivalently the average 
number of atoms in each energy level of the system). Thus the parameter 
of overriding importance in trapped systems is the phase-space density 
rather than the temperature. 


10.4 Bose—Einstein condensation 


Bosons are gregarious particles that like to be together in the same state. 
In contrast, fermions refuse to go into an already occupied state, e.g. 
electrons obey the Pauli exclusion principle (which governs the struc- 
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ture of atoms). Statistical mechanics tells us that when a system of 
bosons reaches a critical phase-space density it undergoes a phase tran- 
sition and the particles avalanche into the ground state. The standard 
textbook treatment of this Bose-Einstein condensation (BEC) applies 
almost exactly to dilute vapours of alkali metals in magnetic traps, and 
the relative simplicity of these systems was a strong motivation for these 
experiments. In comparison, superfluid helium is more complex since in 
this liquid the helium atoms interact much more strongly than the atoms 
in a dilute vapour.!? Appendix F outlines a mathematical treatment of 
BEC that starts from the statistical mechanics of a ‘gas’ of photons in 
thermodynamic equilibrium, i.e. black-body radiation; this treatment 
shows clearly that BEC is a completely different sort of phase transi- 
tion from the ‘ordinary’ condensation of a vapour into liquid caused by 
attractive forces between the atoms, or molecules. Quantum statistics 
becomes important when the occupation of quantum states approaches 
unity. At lower phase-space densities the particles hardly ever try to go 
into the same states, so they behave as classical objects; but when the 
states start to get crowded the particles behave differently, in a way that 
depends on their spin. 

As shown in Appendix F, quantum effects arise when the number 
density n = N/V reaches the value!* 


6 

= —— 10.14 

where Agp is the value of the thermal de Broglie wavelength defined by 
h 

AaB = =... 10.15 

8 Vin ket le 


This definition corresponds to the usual expression Agp = h/Mv with a 
speed v characteristic of the gas. Simply speaking, the de Broglie wave- 
length gives a measure of the delocalisation of the atoms, i.e. the size of 
the region in which the atom would probably be found in a measurement 
of its position. This uncertainty in position increases as the momentum 
and associated kinetic energy decrease. Quantum effects become impor- 
tant when Agp becomes equal to the spacing between the atoms, so that 
the individual particles can no longer be distinguished.'® 

For an ideal Bose gas at the density of liquid helium (145kgm~° at 
atmospheric pressure) eqns 10.14 and 10.15 predict a critical tempera- 
ture of 3.1 K; this is close to the so-called A-point at 2.2K, where helium 
starts to become superfluid (see Annett 2004). The equations derived 
for a gas give quite accurate predictions because, although helium liqui- 
fies at 4.2K, it has a lower density than other liquids (cf. 10° kg m7? for 
water). Helium atoms have weak interactions because of their atomic 
structure—the closed shell of electrons leads to a small size and very low 
polarizability. The detailed properties of superfluid helium are, however, 
far from those of a weakly-interacting Bose-condensed gas. In contrast, 
trapped atomic gases have much lower densities so BEC occurs at tem- 
peratures of around one microkelvin. It is quite amazing that sodium 


3 


13 Historically, the idea of BEC arose af- 
ter the Indian physicist Satyendra Bose 
published a paper in 1924 that derived 
the Planck distribution for radiation in 
a new way, by looking at a statistical 
distribution of photons over the energy 
levels. Einstein realised that the same 
approach could be applied to particles 
(that we now call bosons), and he pre- 
dicted the occurrence of Bose-Einstein 
condensation. Einstein wrote,‘the the- 
ory is pretty but there may be some 
truth to it’, in a letter to Paul Ehren- 
fest. 


14The symbol n is used for number 
density (as in most statistical mechan- 
ics texts) rather than N as in previous 
chapters. The number of atoms is V 
and V is the volume. 


15-This very general criterion also ap- 
plies to fermions but not with the same 
numerical coefficient as in eqn 10.14. 
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161 these systems the Bose-condensed 
gas is metastable; however, conditions 
can be reached where the condensate 
has a lifetime of many minutes. The 
ultra-cold molecules that form when 
cold atoms recombine are interesting 
to study in their own right. To 
achieve long lifetimes, limited by the 
recombination of atoms, the conden- 
sate must be held in an ultra-high vac- 
uum to reduce the rate of collisions with 
molecules of the background gas in the 
apparatus. 


17 Neglecting a factor of 101/3 ~ 2. 


18This number at Tc has been cho- 
sen because after further evaporation it 
would lead to a condensate of roughly 
10° atoms. 


190 nce a few bosons have accumulated 
in a particular state others want to 
join them. This cooperative behaviour 
arises because the constructive interfer- 
ence for bosons leads to a rate of stim- 
ulated transitions into a level propor- 
tional to the number in that level. 


20This number of atoms gives a Tc 
close to the recoil limit for sodium (and 
To varies only slowly with NV’). Hence 
the atoms have a de Broglie wavelength 
comparable with that of laser cooling 
light. This coincidence gives a conve- 
nient way of remembering the approx- 
imate values. Note that To depends 
on the species of atom, since in a trap 
with a given magnetic field, limited by 
the maximum current through the coils, 
the oscillation frequencies of atoms are 
inversely proportional to /M. 


and other alkalis can exist as atomic vapours at such low temperatures, 
and indeed this was doubted by many people before it was achieved in 
experiments. This is possible at very low densities because the processes 
that lead to recombination into molecules, and also heat the sample, 
occur slowly compared to the formation of a Bose-Einstein condensate."® 


10.5 Bose—Einstein condensation in 
trapped atomic vapours 


The usual textbook discussions of Bose-Einstein condensation consider 
a gas with a uniform density throughout a box of fixed volume, i.e. a 
homogeneous gas. However, the experiments with magnetically-trapped 
atoms correspond to a Bose gas in a harmonic potential and here we look 
at this inhomogeneous system. This section shows how to derive rough 
values of the important quantities in a way that gives a good physical 
understanding of the properties of the trapped Bose gas. A cloud of 
thermal atoms (i.e. not Bose-condensed) in a harmonic potential with a 
mean oscillation frequency w has a radius r given by 


1 ih 

5 Muir ~ shel. (10.16) 
To the level of accuracy required we take the volume of the cloud as 
V ~ 4r? (a reasonable approximation to the volume of a sphere 47r? /3). 
This gives the number density as n ~ N//4r°, which, when combined 
with eqn 10.14, gives 


AOle s Ss kpc 

AdB hw 

This result comes from substituting for r from eqn 10.16 and for Agp 

at the critical temperature, To, from eqn 10.15." Fortuitously, despite 

all our approximations, this expression lies within 10% of the value de- 

rived more carefully for a Gaussian distribution of atoms (see books by 

Pethick and Smith (2001) and Pitaevskii and Stringari (2003)). When 

the trapping potential does not have spherical symmetry, this result can 
be adapted by using the geometrical mean 


(10.17) 


o= (coxa : (10.18) 
For a cloud of NV = 4 x 10° atoms we find!8 
kpTco = haN/? = ho x 160. (10.19) 


This result shows clearly that at the BEC transition the atoms occupy 
many levels of the trap and that it is quantum statistics which causes 
atoms to avalanche into the ground state.’ A typical trap with 0/27 = 
100 Hz has a level spacing of hi/kg = 5nK (in temperature units). In 
this trap To ~ 1K.?° For 4 x 10° atoms in this trap, eqn 10.14 gives 
as the density at Tc 


no = 40pm”? = 4x 10° cm *. 
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The quantum statistics of identical particles applies to composite parti- 
cles in the same way as for elementary particles, so long as the internal 
degrees of freedom are not excited. This condition is well satisfied for 
cold atoms since the energy required to excite the atomic electrons is 
much greater than the interaction energy.! 


10.5.1 The scattering length 


The quantum theory of scattering is well described in most quantum 
mechanics texts, and the brief summary given here is only intended as 
a reminder of the salient points that are relevant for understanding the 
collisions between ultra-cold atoms in a gas. An important feature of 
very low-energy collisions is that, although the potential of the attractive 
interaction between two atoms has the shape shown in Fig. 10.7, the 
overall effect is the same as a hard-sphere potential. Thus we can model 
a low-temperature cloud of atoms as a gas of hard spheres,”? in particular 
for the calculation of the contribution to the energy of the gas from 
interactions between the atoms. 

This section gives a justification for this behaviour of ultra-cold atoms 
by simple physical arguments without mathematical details. The molec- 
ular potential (as in Fig. 10.7) has bound states that correspond to a 
diatomic molecule formed by the two atoms, and this part of the molec- 
ular wavefunction is a standing wave analogous to those that lead to 
the quantised energy levels of electrons in atoms.?? It is the unbound 
states, however, that are appropriate for describing collisions between 
atoms in a gas and these correspond to travelling-wave solutions of the 
Schrédinger equation (illustrated in Figs 10.8 and 10.9). In the quan- 
tum mechanical treatment we need to solve the Schrodinger equation to 
determine what happens to an atomic wavepacket in the potential. The 
angular momentum of a particle in a radial potential V(r) is a conserved 


Tmin 


21D. Ehrenfest and J. R. Oppenheimer 
proved the spin-statistics relation for 
indistinguishable composite particles in 
‘Note on the statistics of nuclei’, Phys. 
Rev., 37, 333 (1931). 


22 At least in most of the cases of inter- 
est for ultra-cold atoms. 


23-This part of the molecular wave- 
function represents the vibrational mo- 
tion of the molecule. Other aspects 
of molecular physics are described in 
Atkins (1994). 


Fig. 10.7 The potential V(r) for the 
interaction between two neutral atoms 
as a function of their separation r. 
At small separations the shells of elec- 
trons around each atom overlap and the 
strong electrostatic repulsion keeps the 
atoms apart, while at larger separations 
attractive van der Waals interactions 
dominate. These forces balance each 
other at the separation, where the po- 
tential is a minimum—this corresponds 
to the equilibrium separation of the di- 
atomic molecule that is the bound state 
of the two atoms, e.g. the two sodium 
atoms are 0.3nm apart in Nag. This in- 
teratomic potential is called the molec- 
ular potential and it determines other 
properties of the molecule, e.g. its vi- 
brational frequency (corresponding to 
oscillations in the potential well). 
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Incident 
plane 
wave 


Fig. 10.8 (a) A pair of colliding atoms with relative velocity v in their centre-of-mass frame. The impact parameter Timpact 
determines their relative orbital angular momentum (which is conserved). (b) In the quantum mechanical description of a 
low-energy scattering the solution of the Schrodinger equation is the sum of an incident plane wave eikz plus the wave scattered 
by the potential that expands outwards from r = 0, i.e. a wavefunction of the form ~ « e'** + f;, (0) e*” /r. This is an eigenstate 
with energy E = h?k?/M (for elastic scattering, i.e. no loss of energy). Only the scattering amplitude f, (0) depends on the 
potential V (r). For low energies the scattering amplitude is a constant, f « Yo,o, and the condition in eqn 10.20 is fulfilled so 
that the phases kz and kr have a negligible variation across the region of interaction; hence 7 ~ 14+ f/r. Writing the scattering 
amplitude as f = —a so that #(r) = 0 on a spherical surface of radius r = a, shows that the scattered wave is a spherical wave 
that is equivalent to the scattering from a hard sphere of this radius. The comparison with a hard sphere is useful for positive 


=H 


Scattered spherical wave 


values of a, but scattering theory allows negative values a < 0 for which the outgoing wave is also spherical. 


Fig. 10.9 The solution of the 
Schrodinger equation for low-energy 
scattering from a molecular potential 
(cf. Fig. 10.7). The plot shows P(r), 
where R(r) = P(r)/r is the radial 
wavefunction. At long range the 
overall effect of any potential (with 
a finite range) on the scattered wave 
is a phase shift (relative to a wave 
scattered from a point-like object at 
r = 0, shown as a dotted line)—in 
this region P(r) = sin(kr—@) and 
the scattering is indistinguishable from 
that of a hard-sphere potential that 
gives the same phase shift. Potentials 
are characterised by the radius a of the 
equivalent hard-sphere potential. (Ra- 
dial wavefunctions can be calculated 
by the numerical method outlined 
in Exercise 4.10, and the extension 
of such a computational approach to 
study quantum scattering is described 
in Greenhow (1990). From Butcher et 
al, (1999). 


V(r) 


Wavefunction without potential 


Wavefunction with potential 
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quantity, in both classical and quantum mechanics; thus the eigenstates 
of the angular part of the Schrédinger equation are the spherical har- 
monic functions, as in the central-field approximation for atoms. We 
can deduce the orbital angular momentum by correspondence with the 
following classical calculation. 

A pair of colliding atoms has relative orbital angular momentum 
fil ~ M'vrimpact, Where M' is the reduced mass,”* v is their relative 
velocity and Trimpact is the impact parameter (defined in Fig. 10.8). For 
a collision to happen rimpact must be less than the range of the inter- 
action rint. Thus we find that Al < M'vrint = hrint/Aap, using the de 
Broglie relation. This implies that | < 27rjnt/Aap and therefore, when 
the energy is sufficiently low that 


A 
ae > Tint ? 


= (10.20) 


we have | = 0, i.e. the atoms have no relative orbital angular momen- 
tum. In this regime, the scattered wavefunction is a spherical wave 
proportional to Yizo,m=o0 no matter how complicated the actual poten- 
tial. At the recoil limit of laser cooling the atoms and the photons 
of the laser light have a comparable wavelength Aqp ~ Atignht, because 
they have similar momentum, e.g. sodium atoms at Trecoiy = 2 uK have 
Aap /27 ~ 100nm.”° This estimate indicates that the condition 10.20 is 
fulfilled at temperatures of a few microkelvin since the range of the inter- 
action between neutral atoms is normally considerably less than 100 nm 
(equivalent to 2000 Bohr radii).2° This spherical wave corresponding to 
the eigenfunction Yo, is called the s-wave, where s denotes zero relative 
orbital angular momentum (cf. s-orbitals that are bound states with 
a 0).-° 

The discussion of the s-wave scattering regime justifies the first part of 
the statement above that low-energy scattering from any potential looks 
the same as scattering from a hard-sphere potential when the radius of 
the sphere is chosen to give the same strength of scattering. The radius 
of this hard sphere is equivalent to a parameter that is usually called the 
scattering length a. This single parameter characterises the low-energy 
scattering from a particular potential.28 For example, sodium atoms in 
the |F = 1, Mp = 1) state have a = 2.9nm, which is about an order of 
magnitude greater than the size of the atom’s electronic charge cloud 
and does not correspond to any physical feature in the real atom. In 
the following, the energy contribution from interactions between the 
atoms in a low-temperature gas is calculated assuming that the atoms 
act like hard spheres (which is just a useful fiction that is mathematically 
equivalent to the scattering from the actual potential). First, let us study 
a simple example that illustrates features that arise in the general case. 


Example 10.3 A particle in a spherical well 

The Schrodinger equation for a particle in a spherically-symmetric po- 
tential can be separated into an angular equation and a radial equation 
that can be written in terms of P(r) = rR(r), as in eqn 2.16. In this 


24The difference between the reduced 
mass and the mass of the individual 
atoms is not particularly important in 
this rough estimate. 


25 Calculated in the laboratory frame 
of reference, i.e. not using the reduced 
mass. 


26 Molecular potentials, such as that 
shown in Fig. 10.7, do not have a 
sharply-defined cut-off. The determi- 
nation of the minimum distance at 
which the atoms can pass with a neg- 
ligible effect on each other requires a 
more general treatment. 


27 Tn the particular case of two identi- 
cal bosons in the same internal state, 
the spatial wavefunction must be sym- 
metric with respect to an interchange of 
the particle labels. Such wavefunctions 
have even orbital angular momentum 
quantum numbers / = 0, 2, 4, etc. Thus 
p-wave scattering cannot occur for col- 
lisions between identical bosons and the 
s-wave scattering regime extends up to 
the threshold energy for d-waves. 


28 There are many potentials that can 
give the same value of a. 
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29 This is the usual result for an infinite 
square well of length LE rewritten with 
L=b-a. 


30The reduced mass is used for the 
same reason as in the hydrogen atom, 
i.e. in one-electron atoms the electron 
and nucleus orbit around their centre- 
of-mass frame, and therefore it is the 
reduced mass of the electron me xX 
Mn/(me + Mn) that appears in the 
Schrodinger equation for the atom—the 
(slight) dependence of the electron’s re- 
duced mass on the nuclear mass My 
leads to the isotope shift of spectral 
lines (Chapter 6). 


example we shall examine the properties of wavefunctions with | = 0 
(corresponding to s-waves), so that the equation for P (r) is simply 


lise 


ome gen + Y Pir) =e er) ; 


(1020) 
where M’ is the mass of the particle. For the potential V (r) = 0 within 
the range a < r < b and V(r) = o elsewhere (for r < a and r > ), 
this equation has the same form as that for an infinite square well in one 
dimension. The solution that satisfies the boundary condition w(r) = 0 
abt = ais 

P=Csin(k(r—-a)), (10.22) 
where C is an arbitrary constant. The boundary condition that the 


wavefunction is zero at r = b requires that k (b — a) = nz, where n is an 
integer; hence the energy eigenvalues are given by”? 


h2 ke? h2n2n? 
~ 2M! 2M'(b-a) 
To make a link with scattering theory we consider what happens when 
a < b, so that the wavefunction is contained in a spherical region of 


radius b but excluded from a small hard sphere of radius a at the origin; 
the energy of the lowest level (n = 1) can be written as 


(10.23) 


E (10.24) 


Tes ( i_2 
~ 2M'b? b 
This equals the energy for a = 0 plus a small perturbation proportional 
to a which arises because the kinetic energy depends on the size of the 


region between r = a and b. (The expectation value of the potential 


Cet re 


energy is zero.) At short range where sin(k(r—a)) ~ k(r—a) the 
solution in eqn 10.22 reduces to 
iy 
R(r) ~ = x1— - (10.25) 


This is the general form for a wavefunction with a low energy (ka < 1) 
near a hard sphere (in the region a < r < Aqgp/27) and the features 
illustrated by this example arise in the general case. The use of this 
situation to illustrate scattering might seem to contradict the above 
assertion that scattering involves wavefunctions that are unbound states, 
but we will see below that such waves have a similar increase in energy 
proportional to a. In any case, we will be applying the results to atoms 
that are confined (in the harmonic potential created by a magnetic trap). 


A collision between a pair of atoms is described in their centre-of- 
mass frame as the scattering from a potential V (r) of a particle with a 
reduced mass given by®? 

; M, M2 


Mi = ee 6 
M, + Mp (10.26) 


10.5 Bose-Einstein condensation in trapped atomic vapours 


In a gas of identical particles, the two colliding atoms have the same 
mass M, = Mz = M and therefore their reduced mass is M’ = M/2.°*! 

The discussion above showed that, in the s-wave regime, the scattering 
is the same for a hard-sphere potential and for the actual molecular 
potential (see also Fig. 10.9). Using the wavefunction in eqn 10.25, with 
an amplitude x, we find that the expectation value of the kinetic energy 
operator (—h?/2M')V?, with M' = M/2, is given by*? 


c= fff Er ob-ye 


Sarita elle 
Ania, 3 
aera (10.27) 


Taking the upper limit of integration over r as infinity gives a reasonable 
estimate of the energy (Pathra 1971).** This increase in energy caused 
by the interaction between atoms has the same scaling with a as in 
eqn 10.24, and arises from the same physical origin. We shall use this 
result to account for the interatomic interactions in a Bose-Einstein 
condensate in the following section. 

Finally, we note a subtle modification of scattering theory for identical 
particles. The usual formula for the collision cross-section is 47a”, but 
identical bosons have 


o = 8na’. (10.28) 


The additional factor of 2 arises because bosons constructively interfere 
with each other in a way that enhances the scattering.*4 This and the 
other features of collisions between ultra-cold bosons relevant to Bose- 
Einstein condensation are explained more fully in the books by Metcalf 
and van der Straten (1999), Pethick and Smith (2001), and Pitaevskii 
and Stringari (2003). These references also give a more careful definition 
of the scattering length that shows why this parameter can be positive 
a > 0, or negative a < 0 (see Fig. 10.10). However, the majority of 
experimental work on Bose-Einstein condensation has been carried out 
with states of sodium and rubidium atoms that have positive scattering 
lengths, corresponding to the effectively repulsive hard-sphere interac- 
tions considered in this section. A full treatment of quantum scattering 
also allows a more rigorous derivation of the temperature below which 
there is only s-wave scattering, but the exact limits of the s-wave regime 
are not important for the purposes of this chapter since collisions be- 
tween atoms at temperatures of a few microkelvin are normally well 
within this regime.*° 


233 


31The transformation to the centre-of- 
mass frame and the use of reduced mass 
is very similar in classical and quantum 
mechanics. 


32-The integrand |Vo|? = Vy" Vw is 
obtained from ~*V2% by integration 
by parts (as in the standard derivation 
of probability current in quantum me- 
chanics). 


33 Most of the contribution to the inte- 
gral comes from the region where a < 
rT < Agp/27, where eqn 10.25 is a good 
approximation to the wavefunction. 


2* The probability of bosons going into 
a particular quantum state is enhanced 
by a factor N + 1, where N is the 
number of particles in that state—for 
a two-body collision this increases the 
probability by a factor of 2. In con- 
trast, two identical fermions cannot oc- 
cupy the same state and therefore s- 
wave collisions, in which the particles 
have the same spatial state, are forbid- 
den for fermions in the same spin state. 
(As noted previously, p-wave collisions 
do not occur for bosons.) 


35 Exceptions might occur where a res- 
onance gives especially strong interac- 
tions between the atoms. 
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Fig. 10.10 The wavefunction P(r) = 
rR(r) for very-low-energy scattering 
from slightly different molecular poten- 
tials with scattering lengths that are 
(a) positive and (b) negative with very 
large magnitude. From Butcher et al. 
(1999). 


36 Actually, there are NM — 1 other 
atoms, but the difference from N is neg- 
ligible for large numbers of atoms. 


37 This quantity turns out to be equiv- 
alent to the chemical potential in ther- 
modynamics: pp = OEF/ON is the en- 
ergy required to remove a particle from 
the system. This is not the same as the 
average energy per particle—see Exer- 
cise 10.7. 


(a) V(r) 


10.6 A Bose-Einstein condensate 


The interaction between atoms is taken into account by including a term 
in the Schrédinger equation that comes from eqn 10.27, proportional to 
the square of the wavefunction: 


h2 
{Fv +V(e) ol bw =p. (10.29) 


The extra energy from the interactions is proportional to a|?, the prob- 
ability of finding a particle in a given region, and the coupling constant 
is 
_ 4nh?Na 
ei 


This comes from eqn 10.27 with |x|? + A || that gives the interaction 
per atom in the presence of V atoms.*° The symbol ju is used (instead 
of E) to represent the energy of an individual atom in the presence of 
all the others (cf. the central-field approximation in Chapter 4).°’ This 
nonlinear Schrodinger equation is called the Gross—Pitaevskii equation 
after the people who first (independently) wrote it down and a rigorous 


(10.30) 
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derivation can be found in the book by Pitaevskii and Stringari (2003). 
Trapped atoms experience a harmonic potential 
1 
VG@)= 5M (wea? + wily? + wz"). (10.31) 
For simplicity, we shall consider all three oscillation frequencies equal, 
i.e. the isotropic potential Mw?r?/2, and use a variational method to 


estimate the energies. We choose a trial wavefunction that is a Gaussian 
function: 


p = AeW™ /20” | (10.32) 
Substitution into the Schrédinger equation gives the energy as 
3 a b? quel 
B = Ghw | a h+—tae. 10.33 
4 be ho (27)3/? be ( ) 


Differentiation of this expression shows that when g = 0 the minimum 
energy occurs when } = apo, where 


h 


ae 
is the characteristic radius of the Gaussian ground-state wavefunction in 
the quantum harmonic oscillator. A sodium atom in a trap with oscilla- 
tion frequency w/2m = 100 Hz has ayo = 2x 10-6 m. For this equilibrium 
value of b, the two terms representing the kinetic and potential energies 
give an equal contribution to the total energy which is E = (3/2) hw, as 
expected for the ground state of the quantum harmonic oscillator.** The 
variational method gives exact results in this particular case because the 
trial wavefunction has the same Gaussian form as the actual solution for 
a harmonic oscillator. 

Now we shall consider what happens when g > 0.°° The ratio of the 


(10.34) 


Gaho = 


terms representing the atomic interactions and the kinetic energy is*° 
4 N. 
a (10.35) 
3 (27) / Qj, hw Gho 


The nonlinear term swamps the kinetic energy when NV > ayo/a. This 
ratio equals ano/a = 800 for ano = 2m and a = 2.6nm, and so in 
this case when the number of atoms in the condensate VV’ > 800 we 
can neglect the kinetic energy.*! We could estimate the condensate’s 
size from eqn 10.33 by determining the value of r at which the confining 
potential balances the repulsive interactions, and this variational method 
is described in Exercise 10.6; but it is remarkably easy to solve the Gross— 
Pitaevskii equation when the kinetic energy term is neglected. In this 
so-called Thomas—Fermi regime eqn 10.29 becomes simply 


{V (r) +9|b) bw = py. (10.36) 
So for the region where w # 0 we find 
hee ©. (10.37) 


g 


38 Bach of the three degrees of freedom 
has a zero-point energy of 5 hw. 


39small Bose-Einstein condensates 
with effectively attractive interactions 
have been created, but they collapse 
inward as the number of atoms grows 
(see Exercise 10.11). 


40-The factor of (4/3)(4m/(27)3/2) that 
arises has a numerical value of about 
unity. 


41 Typically, experiments have N > 105 
and the effectively repulsive interac- 
tions make the condensate much larger 
than aj,.The interactions between the 
atoms in a dilute gas only have a small 
effect (< 10%) on the value of Tc. 
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Hence, the number density of atoms n(r) = N |w (r)|° in the harmonic 
potential has the form of an inverted parabola: 


a oy? 2 
where no, the number density at the centre of the condensate, is 
ee le (10.39) 
g 


The condensate has an ellipsoidal shape and the density goes to zero at 
points on the axes given by x = +R,, y= +R, and z = +R,, defined 
by 


= Mu? 2 = pL, (10.40) 


and similarly for R, and R,. In the Thomas—Fermi regime, the atoms 
fill up the trap to the level of the chemical potential as illustrated in 
Fig. 10.11, just like water in a trough. The chemical potential p is 
determined by the normalisation condition 


i= // iwi? @rdyd: = = on ee (10.41) 
g 15 
A useful form for p is 
2/5 
Po (= *) (10.42) 
2 Qho 


The mean oscillation frequency is defined as @ = CE 3 and Gye 
is calculated using this frequency in eqn 10.34. Typical values for an 
loffe trap are given in the following table together with the important 
properties of a BEC of sodium, calculated from the key formulae in 
eqns 10.42, 10.40 and 10.39 (in that order). 


Scattering length (for Na) a 2.9nm 
Radial oscillation frequency (wz = w,) We / 2m 250 Hz 
Axial oscillation frequency w, [20 16 Hz 
Average oscillation frequency w/a 100 Hz 
Zero-point energy (in temperature units) $hw/kp 2.4nK 
Harmonic oscillator length Gho 2 um 
Number of atoms in condensate No TO: 
Chemical potential p 130 nk 
Radial size of condensate R, = R, 15 wm 
Axial size of condensate Re 95 ym 
Peak density of condensate no 2x 10cm: 
Critical temperature (for 4 x 10° atoms) To 760 nk 


Critical density (for 4 x 10° atoms) neo Ax 10°* cme: 
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(a) 


(b) 


Fig. 10.11 In the Thomas—Fermi 
regime the condensate has the same 
shape as the confining potential. (a) 
A harmonic potential. (b) The density 
of the atoms in a harmonic trap has 
1 Ss an inverted-parabolic shape (along all 
three axes). 


I 


The critical temperature and density at the onset of Bose condensation 
are not properties of the condensate, but are calculated for a cloud 
of 4 x 10° atoms (from eqn 10.19); this would lead to a condensate 
of roughly No ~ 10° atoms after evaporative cooling to T/To ~ 0.5 
(where most atoms are in the condensate, see eqn F.16). Both yu and 
Tc have a weak dependence on N (eqns 10.19 and 10.42) and a similar 
(but not the same) dependence on w, so these quantities have a similar 
relative magnitude for many cases. Note, however, that depends on 


(a) (b) (c) 


Fig. 10.12 This sequence of images shows a Bose-Einstein condensate being born out of a cloud of evaporatively-cooled atoms 
in a magnetic trap. Each image was taken after a time-of-flight expansion. The cloud changes its size and shape as it undergoes 
a phase transition: (a) a thermal cloud just above the critical temperature To has a spherical shape (isotropic expansion); (b) a 
cloud of atoms at 0.97 > has a Bose-condensed fraction in the centre surrounded by a halo of thermal atoms; and (c) well below 
the critical temperature (< 0.57c) most of the atoms are in the condensate (lowest energy state of the trap). These images 
come from a system that does not have the same aspect ratio as an Ioffe trap, but they illustrate the anisotropic expansion 
of the condensate wavefunction. See Fig. 10.13 for dimensions and other details. Data provided by Nathan Smith, Physics 
department, University of Oxford. 
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42-The repulsive interactions prevent 
the much greater increase in the density 
that would occur if atoms congregated 
in a region of volume aj. 


Fig. 10.13 Cross-sections of the images 
similar to those shown in Fig. 10.12, but 
for different temperatures and a time 
of 12ms after release from the mag- 
netic trap. (a) Just below the critical 
temperature (0.997¢) a small central 
peak appears on the Gaussian distribu- 
tion of thermal atoms. (b) At 0.82T> 
most of the atoms are in the condensate 
with some thermal atoms in the wings. 
(c) At 0.637 ¢ only a small thermal 
cloud remains. This narrowing of the 
distribution and change from a Gaus- 
sian to an inverted-parabolic shape oc- 
curs over a small range of temperatures 
which is a behaviour characteristic of a 
phase transition. From Hechenblaikner 
(2002), for a trap with we = wy = 
Bae 3 WAG aks Biel we, == Wap Sh BEG Ala: 
The fraction of atoms in the conden- 
sate (No/N) differs from that predicted 
by eqn F.16 because of interactions (see 
Marago et al. 2001). 


the strength of the interactions, whereas Tc does not. In this example 
the condensate has a density about a factor of 5 greater than the thermal 
cloud at the phase transition,*” but the gas remains dilute because the 
average distance between atoms in the condensate is larger than the 
scattering length, that is na? < 1 (for the data in the table noa® = 
4 x 10-°). Equation 10.40, and the similar equation for R,, give the 
ratio of sizes as R,/R, = wz/wz = 16 (and R, = Fz), so in this trap 
the condensate has the shape of a long, thin cigar. 


(a) rf = 1.87 MHz 
12 No/N = 0.06 


(b) 


Optical depth 


(c) 


rf = 1.76 MHz 
No/N = 0.58 


0 100 200 300 400 500 
Distance (um) 
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To observe the condensate experimenters record an image by illumi- 
nating the atoms with laser light at the resonance frequency.*? Typically, 
the experiments have an optical resolution of about 5 um, so that the 
length of the condensate can be measured directly but its width is not 
precisely determined. Therefore the magnetic trap is turned off sharply 
so that the atoms expand and some time later a laser beam, that passes 
through the cloud of atoms onto a camera, is flashed on to record a 
shadow image of the cloud. The repulsion between atoms causes the 
cloud to expand rapidly after the confining potential is switched off (see 
Exercise 10.6). The cigar-shaped cloud expands more rapidly in the ra- 
dial direction (a and y) than along z, so that after several milliseconds 
the radial size becomes bigger than that along z, i.e. the aspect ratio 
inverts.** In contrast, the uncondensed atoms behave as a classical gas 
and expand isotropically to give a spherical cloud, since by definition the 
thermal equilibrium implies the same kinetic energy in each direction. 
Pictures such as Fig. 10.12 are the projection of the density distribution 
onto a two-dimensional plane, and show an obvious difference in shape 
between the elliptical condensate and the circular image of the thermal 
atoms. This characteristic shape was one of the key pieces of evidence 
for BEC in the first experiment, and it is still commonly used as a di- 
agnostic in such experiments. Figure 10.13 shows the density profile of 
the cloud of atoms released from a magnetic trap for temperatures close 
to the critical point, and below. 


10.7 Properties of Bose-condensed gases 


Two striking features of Bose-condensed systems are superfluidity and 
coherence. Both relate to the microscopic description of the condensate 
as NV atoms sharing the same wavefunction, and for Bose-condensed 
gases they can be described relatively simply from first principles (as 
in this section). In contrast, the phenomena that occur in superfluid 
helium are more complex and the theory of quantum fluids is outside 
the scope of this book. 


10.7.1 Speed of sound 


To estimate the speed of sound v, by a simple dimensional argument we 
assume that it depends on the three parameters pz, M and w, so that*° 
Us x p* MP7. (10.43) 


This dimensional analysis gives*® 


Us & a (10.44) 


This corresponds to the actual result for a homogeneous gas (without us 
needing to insert any numerical factor), and gives a fairly good approx- 
imation in a trapped sample. The speed at which compression waves 


43 Generally, absorption gives a better 
signal than fluorescence but the optical 
system and camera are similar in both 
cases. 


44 This expansion of the wavefunction is 
predicted by including time dependence 
in the nonlinear Schrédinger equation. 


45 The size of the condensate R is not 
another independent parameter, see 
eqn 10.40. 


46 Comparing the dimensions of the 
terms in eqn 10.43 gives 


ins > = kem*a “|? ke? a 77, 


Hence a = —G = 1/2 and y=0. 
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travel in the gas has great significance for superfluidity. For motion 
slower than this speed the condensate flows smoothly around obstacles 
without exciting any particles out of the ground state of the quantum 
gases. This type of flow does not dissipate any energy and so it is fric- 
tionless and the gas is superfluid. 


10.7.2 Healing length 


The Thomas—Fermi approximation neglects the kinetic energy term in 
the Schrédinger equation. This leads to a physically unrealistic sharp 
edge at the surface of the condensate (see Fig. 10.11)—such a discon- 
tinuity in the gradient would make Vw infinite. Therefore we have 
to take kinetic energy into account at the boundary. To determine the 
shortest distance € over which the wavefunction can change we equate 
the kinetic term (that contains V7 ~ h?/(2M€E?)) to the energy scale 
of the system given by the chemical potential. Atoms with energy higher 
than p leave the condensate. Using no = Ny/g (from eqn 10.39) and 
eqn 10.30 for g, we find that 
2 2 

aie. = 0 (10.45) 

2M é N AL 
Hence € = 1/,/87ano, e.g. € = 0.3m for a sodium condensate with 
no = 2x 10'4cm~?. Typically, € < R; and smoothing of the wavefunc- 
tion only occurs in a thin boundary layer, and these surface effects give 
only small corrections to results calculated using the Thomas-Fermi ap- 
proximation. This so-called healing length also determines the size of the 
vortices that form in a superfluid when the confining potential rotates 
(or a fast moving object passes through it). In these little ‘whirlpools’ 
the wavefunction goes to zero at the centre, and € determines the dis- 
tance over which the density rises back up to the value in the bulk of 
the condensate, i.e. this healing length is the distance over which the 
superfluid recovers from a sharp change. 


10.7.3. The coherence of a Bose—Einstein 
condensate 


Figure 10.14 shows the result of a remarkable experiment carried out by 
the group led by Wolfgang Ketterle at MIT. They created two separate 
condensates of sodium at the same time. After the trapping potential 
was turned off the repulsion between the atoms caused the two clouds to 
expand and overlap with each other (as in the time-of-flight technique 
used to observe Bose-Einstein condensation, see Fig. 10.12). The two 
condensates interfere to give the fringes shown in the figure; there are 
no atoms at certain positions where the matter waves from the two 
sources interfere destructively—these atoms do not disappear, but they 
are redistributed to positions in the fringe pattern where the matter 
waves add constructively. Such interference is well known in optics; 
however, there is a very interesting difference between this experiment 
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and the usual double-slit experiments. In the MIT experiment there was 
no fixed relation between the phases of the two condensates and before 
the experiment was carried out it was hotly debated whether interference 
would be observed. Clear interference fringes were observed each time 
the experiment was carried out. However, the position of these fringes 
depended on the difference between the phases of the condensates in 
that particular run—the bright and dark fringes appeared at a different 
place each time the experiment was repeated; thus the fringe pattern 
would ‘wash out’ if averaged over many runs. The observation of the 
interference of two condensates relies on the ability to see interference 
fringes in a single shot. 

The experiment was carried out with an Ioffe trap in which the atoms 
form a long, cigar-shaped cloud. At MIT they used a sheet of light 
to chop the cloud into two pieces of roughly half the original length. 
The light exerted a force that pushed the atoms out of the region of 
high intensity (because it had a blue frequency detuning as explained in 
Section 9.6). This configuration gave two separate potential wells. In 
practice, the following situations both give the same results: (a) when 
the two condensates are created independently; and (b) when a single 
large condensate is divided into two parts after it has formed. The pro- 
cess of turning on a sheet of light in the middle of an already-formed 
condensate produces such a strong perturbation that the two resulting 
condensates have almost random phases. Only recently has the con- 
trolled separation of a condensate into two parts whilst preserving the 
phase been demonstrated in a double-well dipole-force trap—a system 
that corresponds to a beam splitter for matter waves. (Atom optics 
is discussed further in Chapter 11.) However, the intriguing aspect of 
the interference of two independent condensates is its dissimilarity to 
previous experiments. 


Fig. 10.14 The interference fringes ob- 
served when two independent Bose con- 
densates are released from nearby po- 
tential wells and the clouds of atoms 
expand and overlap. This experiment 
was carried out with sodium atoms by 
the team led by Wolfgang Ketterle at 
MIT (Andrews et al. 1997). Copyright 
1997 by the American Association for 
the Advancement of Science. 
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Fig. 10.15 Atoms coupled out of a 
Bose condensate fall downwards un- 
der gravity to form a well-collimated 
matter-wave beam, with analogous 
properties to the beam of light from 
a laser. Courtesy of Nathan Smith 
and William Heathcote, Physics de- 
partment, University of Oxford. 


47 The atoms accelerate as they fall un- 
der gravity so the wave propagation is 
different to that of light. 


LOO jum 


10.7.4 The atom laser 


The phrase ‘atom laser’ has been used to describe the coherent beam of 
matter waves coupled out of a Bose-Einstein condensate (as shown in 
Fig. 10.15). After forming the condensate, the radio-frequency radiation 
was tuned to a frequency that drives a transition to an untrapped state 
(e.g. Mr = 0) for atoms at a position inside the condensate. (This comes 
from the same source of radiation used for evaporative cooling.) These 
atoms fall downwards under gravity to form the beam seen in the figure. 
These matter waves coupled out of the condensate have a well-defined 
phase and wavelength like the light from a laser.4” Many novel matter- 
wave experiments have been made possible by Bose-Einstein condensa- 
tion, e.g. the observation of nonlinear processes analogous to nonlinear 
optics experiments that were made possible by the high-intensity light 
produced by lasers. 


10.8 Conclusions 


Bose-Einstein condensation in dilute alkali vapours was first observed in 
1995 by groups at JILA (in Boulder, Colorado) and at MIT, using laser 
cooling, magnetic trapping and evaporation. This breakthrough, and 
the many subsequent new experiments that it made possible, led to the 
award of the Nobel prize to Eric Cornell, Carl Wieman and Wolfgang 
Ketterle in 2001 (and the Nobel prize web site has much useful informa- 
tion on this subject, with links to the web sites of the research groups). 
Recent BEC experiments have produced a wealth of beautiful images; 
however, the objective of this chapter has not been to cover everything 
but rather to explain the general principles of the underlying physics. 
The two books on BEC by Pethick and Smith (2001) and Pitaevskii and 
Stringari (2003) contain much more detail. 
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Exercises 


(10.1) Magnetic trapping 
An loffe-Pritchard trap has a radial gradient of 
b' = 3Tm?, and the combination of Helmholtz 
and pinch coils gives a field along z with Bo = 
3x 1074 T and curvature 6” = 300T m~?. Calcu- 
late the oscillation frequencies of sodium atoms 
in the trap. 


(10.2) Loading a trap 


(a) 


(b) 


A spherical cloud of 10°° sodium atoms with 
a density of around 10'°cm~* and a tem- 
perature of T = 2.4 x 10-*K is placed in 
a spherically-symmetric trapping potential. 
The temperature and density of the cloud are 
preserved during this loading if 

Mur? = shot. 
Calculate the trapping frequency w that ful- 
fils this mode-matching condition, and ex- 
plain what happens if the trap is too stiff or 
too weak. (In a precise treatment r would be 
the root-mean-square radius of a cloud with 
a Gaussian density distribution. ) 


(10.46) 


Calculate n\3p /2.6 for the trapped cloud, i.e. 
the ratio of its phase-space density to that re- 
quired for BEC (eqn 10.14). 


After loading, an adiabatic compression of 
the trapped cloud changes the oscillation fre- 
quencies of the atoms to w,/2x = 250Hz 
and w,/27 = 16 Hz. The phase-space density 
does not change during adiabatic processes, 
i.e. NASR is constant. Show that this implies 
that TV*/? is constant. Calculate the tem- 
perature and density of the cloud after com- 
pression.*® 


(10.3) Magnetic trapping 


(a) Sketch the energy of the hyperfine levels 


(b 


(d 


(e 


(g 


) 


) 


) 


2 


of the 3s 7S, /2 ground level of sodium as 
a function of the applied magnetic field 
strength. (The hyperfine-structure constant 
of this level is A3; = 886 MHz and sodium 
has nuclear spin J = 3/2.) 


What is meant by a ‘weak’ field in the con- 
text of hyperfine structure? 


Show that for a weak magnetic field the 
states in both hyperfine levels have a split- 
ting of 7GHzT*. 


Explain why the potential energy of an atom 
in a magnetic trap is proportional to the 
magnetic flux density |B]. 

A magnetic trap has a field that can be ap- 
proximated by 


B=0' (ré,z — yé,) 


in the region where r = (2? + y?)!/? < 
10mm, and B = 0 outside this radius. The 
field gradient 6’ = 1.5 Tm’ and the z-axis 
of the trap is horizontal. 


Calculate the ratio of the magnetic force on 
the atoms compared to that of gravity. 


Estimate the maximum temperature of 
atoms that can be trapped in the (i) upper 
and (ii) lower hyperfine levels. State the Mp 
quantum number of the atoms in each case. 
(Assume that the confinement of atoms along 
the z-axis is not the limiting factor.) 


For the clouds of trapped atoms in both (i) 
and (ii) of part (f), describe the effect of ap- 
plying radio-frequency radiation with a fre- 
quency of 70 MHz. 


48'The relation between temperature and volume can also be derived from thermodynamics: TV7~! is constant for an adia- 
batic change in an ideal gas and a monatomic gas has a ratio of heat capacities y = Cp/Cy = 5/2. Actually, the phase-space 
density only remains constant if the potential has the same shape throughout the adiabatic change. In the case of an Ioffe trap, 
the radial potential may change from harmonic to linear (see Example 10.2), giving a small increase in the phase-space density. 
This effect arises because the population of the energy levels stays the same but the distribution of the levels changes—the 
energy levels of a harmonic potential are equally spaced (hw apart), whereas in a linear potential the intervals between levels 
decrease with increasing energy. 
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(10.4) 


(10.5) 


(10.6) 


Evaporative cooling 

A cloud of atoms has a Boltzmann energy distri- 
bution NV (E) = Ae~®”, where 1/8 = kpT and 
the normalisation constant A is found from 


Neotal = af ene dE = mM 3 
0 B 


The cloud has a total energy given by 


ax A 
otal af Ee °? dE = Re NtotaksT . 
0 B 
Hence each atom has a mean energy E = kpT. In 
an evaporative cooling step all atoms with energy 
greater than € escape. 


(a) Calculate the fraction of atoms lost 


AN /Notal- 
Calculate the fractional change in the mean 
energy per atom. 


(b) 


Evaluate your expressions for cuts with Be = 
3 and 6. Compare the ratio of energy lost 
and the number of atoms removed in the two 
cases and comment on the implications for 
evaporative cooling. 


(c) 


The collision rate between atoms in the cloud 
is Reon = nvo. Assuming that the collision 
cross-section o is independent of the energy, 
show that Reon « Mtotal/Ftotai in a harmonic 
trapping potential. Show that the collision 
rate increases during evaporation in such a 
potential. 


The properties at the phase transition 

A cloud of 10° rubidium atoms is confined in 
a harmonic trap with oscillation frequencies of 
w,/2n7 = 16Hz and w,/2x = 250 Hz (and axial 
symmetry). Calculate the critical temperature 
Tc and estimate the density of the cloud at the 
phase transition. 


Properties of a Bose condensate 

The properties of a Bose condensate were calcu- 
lated in the text using the Thomas—Fermi approx- 
imation, which gives accurate results for large 
condensates. This exercise shows that minimis- 
ing the energy in eqn 10.33 (a variational calcula- 
tion) to find the equilibrium size leads to similar 
results. 

In a spherically-symmetric trapping potential, 
rubidium atoms (M = 87a.m.u.) have an os- 
cillation frequency of w/27 = 100 Hz and hence 


(10.7) 


G@ho = 1pm. The atoms have a scattering length 
of a = 5nm. Calculate the following for a con- 
densate with Mo = 10° atoms. 


(a) Show that the repulsive interactions give a 
much greater contribution to the total energy 
than the kinetic term. 

Use eqn 10.33 to find an expression for the 
equilibrium size r and evaluate it. 


What is the density of the condensate? 


(b) 


(c) 
(d) Show that the contribution to the energy 
from the repulsive interactions represents 


two-fifths of the total. 


Find an expression for the energy E (in terms 
of fw). (Note that this expression should 
have the same dependence on the various pa- 
rameters as in eqn 10.42, but with a different 
numerical factor.) Evaluate E/ks. 


(e) 


(f) When the trapping potential is switched off 
suddenly the potential energy goes to zero 
and the repulsive interaction between the 
atoms causes the condensate to expand. Af- 
ter a few milliseconds almost all this energy 
(from the repulsive interactions) is converted 
into kinetic energy. Estimate the velocity 
at which the atoms fly outwards and the 
size of the condensate 30 ms after the trap 
is switched off.*® 


Comment. These estimates of the important 
physical parameters show that, although inter- 
actions have little influence on the phase transi- 
tion (Bose-Einstein condensation occurs because 
of quantum statistics and is completely different 
to the ‘ordinary’ condensation of a vapour into 
a liquid caused by molecular interactions, e.g. 
steam into water), the properties of the conden- 
sate itself do depend on the interactions between 
atoms, e.g. the energy of the condensate is much 
larger than the zero-point energy of the ground 
state of the quantum harmonic oscillator. 


The chemical potential and mean energy per 
particle 


(a) Show that eqn 10.42 follows from the preced- 
ing equations in Section 10.6. 


49The small size of Bose condensates makes them difficult. to view directly, although this has been done in certain experiments. 
Generally, the condensate is released from the trap and allowed to expand before an image (e.g. Fig. 10.12). 


(10.8) 


(10.9) 


(10.10) 


(b) In thermodynamics the chemical potential is 
the energy required to remove a particle from 
the system pp = OE/ON, where E is the total 
energy of the system. Show that FE = 2Ny. 


Expansion of a non-interacting condensate 
Although experiments are not carried out with 
a non-interacting gas it is instructive to consider 
what happens when a = 0. In this case the con- 
densate has the same size as the ground state 
of a quantum harmonic oscillator (for any No) 
and the initial momentum along each direction 
can be estimated from the uncertainty principle. 
For atoms of the same mass as sodium (but with 
a = 0) released from a trap with a radial os- 
cillation frequency of 250 Hz and a frequency of 
16 Hz for axial motion, estimate roughly the time 
of flight at which the cloud is spherical. 


Excitations of a Bose condensate 

The vibrational modes of a condensate can be 
viewed as compression waves that form a stand- 
ing wave within the condensate; hence these 
modes have frequencies of the order of the speed 
of sound divided by the size of the condensate 
vs/R. Show that this collective motion of the 
condensate occurs at a comparable frequency to 
the oscillation of individual atoms in the mag- 
netic trap. 


Derivation of the speed of sound 
The time-dependent Schrédinger equation for the 
wavefunction of an atom in a Bose-Einstein con- 
densate in a uniform potential is 
nde 
dt 
where, for simplicity, the potential has been taken 
as zero (V = 0). The wavefunction y = woe #*/" 
satisfies this equation with a chemical potential 


h? 2 2 


(a 


= g\wo 


The trial wavefunction with small fluctuations 
can be written as 


y= [yo + yeilke—wt) teegme*)| ea int/h 
= toe" + dy (t), 
where the amplitudes |u| and |v| are small com- 
pared to |ywol. 


(a) Show that substituting this function into the 
Schrédinger equation and making suitable 
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approximations leads to the same zeroth- 
order approximation for the chemical poten- 
tial given above and 


ore Cal 
ind (ov (0) 
Bs h?k? 


= ap fv + aldol? 25¢ (t) + gbodu" (t). 


(10.47) 


(b) Show that equating terms with the same time 
dependence leads to two coupled equations 
for u and v that, in matrix form, are 


6 + 2g |Wo|? — gb ) @ 
95)? ex + 2g |Wol’ — pw) \v 

U 
=na("). 


where €, = hk? /2M. 


(c) Hence show that u and v are solutions of the 
matrix equation 


(ese he) = 
90)” ek tp thw) \oJ 
From the determinant of this matrix, find 
the relation between the angular frequency of 
the small oscillations w and the magnitude of 
their wavevector k (the dispersion relation). 
Show that for low energies this gives the same 
expression for the speed of sound w/k found 

in Section 10.7.1.°° 


(10.11) Attractive interactions 


In certain hyperfine states, the scattering length 
a of alkali metal atoms changes with the applied 
magnetic field and this feature has been used to 
perform experiments in which the atoms have at- 
tractive interactions a < 0. 

Show that eqn 10.33 can be written in the form 


2E 

re 
By plotting graphs for various values of the pa- 
rameter G between 0 and —1, estimate the lowest 
value of G for which there exists a minimum in 
the energy as a function of z. For an atomic 
species with a scattering length of a = —5nm 
in a trap where @no = 2m, estimate the max- 
imum number of atoms that a Bose condensate 
can contain without collapsing. 


ce oe Ge: 


50 After problem devised by Keith Burnett, Physics graduate class, University of Oxford. 
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Further reading 


Exercises 


‘Tt might seem advantageous to use 
laser-cooled atoms with larger Agp, but 


we shall see that this is not true. 


Atom interferometry 


The possibility of interferometry with atoms follows directly from wave— 
particle duality. The wave-like propagation of particles, such as atoms, 
means that they undergo interference and diffraction in an analogous 
way to light. This chapter explains how such matter waves have been 
used in interferometers that measure rotation and gravitational acceler- 
ation to a precision comparable with the best optical instruments. As 
in many important developments in physics, atom interferometry relies 
on simple principles—the first part of this chapter uses only elementary 
optics and the de Broglie relation 


han oe ai 
p 


for the wavelength of the matter wave associated with a particle of mo- 
mentum p = Mv. This relation between wavelength and momentum 
also applies to light waves and the momentum of photons, but here we 
shall use Agp exclusively for matter waves and X, without a subscript, 
for the wavelength of light. 

A sodium atom with velocity v = 1000ms~! (typical of a thermal 
beam) has \gp = 2 x 10~'' m—about 1/30 000 times the wavelength of 
visible light, and comparable to that of X-ray radiation. Gratings with 
lines sufficiently close together to diffract such short wavelengths can 
now be made by nano-fabrication, i.e. techniques for making structures 
on scales smaller than 1 ym.’ Neutrons also have short de Broglie wave- 
lengths but, unlike atoms, they pass through crystals and diffract from 
the closely-spaced planes of atoms. Electrons also diffract from crystals 
and their wave properties were observed long ago by Davisson and Ger- 
mer in their classic experimental confirmation of wave—particle duality 
in quantum physics. This previous work with neutrons and electrons is 
mentioned here simply to show that matter-wave interferometry has a 
long history, and both neutron and electron diffraction are now highly- 
developed techniques used in condensed matter physics (Blundell 2001). 
The more recent matter-wave experiments with atoms described in this 
chapter should not be regarded as tests of already well-known quantum 
behaviour; rather their importance lies in their ability to make more 
precise measurements than other techniques in certain applications. 

Consideration of the familiar Young’s double-slit experiment provides 
a good introduction to the basic ideas of matter-wave experiments with 
atoms and gives a feeling for the size of the physical parameters. We 
shall then extend the treatment to a diffraction grating (multiple slits) 
and the design of an interferometer that measures the rate of rotation by 


iL tl 


the Sagnac effect. These matter-wave experiments operate on the same 
principles as similar experiments with light because the atoms remain in 
the ground state and propagate as simple waves. It will be assumed that 
the reader understands the standard treatment of Fraunhofer diffraction 
in optics (Brooker 2003) and key results will be quoted here rather than 
derived. After the discussion of the work with nano-fabricated slits and 
gratings, we shall look at the use of laser light to manipulate the atom’s 
momentum using methods closely related to laser cooling. These laser 
techniques make use of the atom’s internal energy levels—something 
that is not possible for electrons or neutrons. 


11.1 Young’s double-slit experiment 


Young originally carried out his double-slit experiment to test the wave 
nature of light and his simple arrangement still finds practical use in 
measurements of the coherence of light.? Figure 11.1 shows a typical 
experimental layout. Waves propagate from the source slit S through 
the two slits, ©; and Ye, to a point P in the detection plane.* The 
amplitude of the light at any point on the detection plane equals the 
sum of the electric field amplitudes that arrive at that point via slits 
%, and Sg. In any interference or diffraction calculation the resultant 
amplitude at the final point is determined by summing the contributions 
from all possible paths taking account of the phase. For the double slits 
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The double slits also form the basis 
of many theoretical discussions of fun- 

amental issues in quantum mechanics, 
such as why we cannot know which slit 
the photon went through and still ob- 
serve interference. 


3 Young’s fringes are not localised on 
this plane but can be seen throughout 
the far-field region. 


-— Detector 
rt ae 


Fig. 11.1 (a) The apparatus for observing interference from double slits. The light diffracted by the source slit. propagates 
through the two slits ©; and D2 and onto the plane P. The interference fringes can be seen with the eye (with the aid of 
a magnifying eyepiece if necessary), but to further the analogy with atom optics experiments the apparatus is drawn with a 
detector such as a photodiode or photomultiplier. A narrow slit in front of the detector gives good spatial resolution; this slit 
and the detector scan across the fringes, as indicated. The light comes from a lamp, or laser; in a matter-wave experiment an 
atomic oven creates a beam of atoms collimated by the source slit (as shown in Fig. 11.2). (b) The difference in the distance 
from © to P and from X2 to P is d sin@, where d is the slit separation. The angle @ and the distance X in the detection plane 
are related by X = Ltan@. The transverse distances are drawn greatly exaggerated for clarity; for the typical conditions given 


in the text the fringes have an angular separation of 2 x 107“ rad. 
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5 Blementary treatments often assume 
that the slits are equidistant from the 
source (Al = 0) so that the phase of the 
wave is the same at each slit (equivalent 
to having a plane wavefront before the 
slits). This assumption simplifies the 
algebra but it is not a necessary con- 
dition for Fraunhofer diffraction with 
light, or with matter waves. (A more 
detailed description of the assumptions 
such as L >> d can be found in text- 
books on optics, e.g. Brooker (2003).) 


we define J, as the distance from S to P via 5, and similarly [2 for the 
path through slit £2, as shown by the dotted lines in Fig. 11.1(b).* Slits 
of the same size contribute equally to the total amplitude at a point on 
the plane P: 

Ep « Ey Gas ct as) . (11.2) 


The intensity is proportional to the square of this amplitude, I « |E|?, 


so that : 
I = Ip cos” (S) 


Here ¢ = 27(€2 — £;)/X is the phase difference between the two arms 
and Jp is the maximum intensity. Bright fringes occur at positions in 
the detection plane where the contributions from the two paths inter- 
fere constructively; these correspond to ¢ = n2z, with n an integer, or 
equivalently 


(11.3) 


bo — ty, =A. (11.4) 


To find the spacing of the fringes in the plane of observation we define 
the coordinate X measured perpendicular to the long axis of the slits 
in the plane P. In terms of the small angle defined in Fig. 11.1(b) this 
becomes 


X = Ltané. (115) 


A similar small angle approximation allows us to express the path length 
difference in Fig. 11.1(b) as 
fg —€, = Al+dsiné@. (11.6) 
Here Al = SY — S¥p is the path difference before the slits.> The path 
length difference from the two slits of separation d to P is dsin@. The 
last three equations give the spacing of the fringes as 
LX 


AX =—. 


: (11.7) 


For an experiment with visible light of wavelength \ = 6 x 10-7 m, slits 
of separation d = 3 x 10-*m and L = 1m, the fringes have a spacing 
of AX = 2mm and can be clearly seen by eye. The treatment so far 
assumes a small source slit at S that acts like a point source to illuminate 
the double slits coherently. The condition for this is that the double slits 
fall within the angular spread of the light diffracted from the source slit 
(Brooker 2003). The diffraction from a slit of width wg has an angular 
spread Ogig¢ ~ A/ws. Therefore coherent illumination of two slits at a 
distance L’ from this source slit requires L'Ogi¢ > d, or 


AL! 
ra 
For L' = 0.1m and the values of \ and d used previously, we find 


ws < 2x 10-4m. Such slits are made by standard techniques and we 
know that Young’s experiment with light is relatively straightforward to 


ws < (11.8) 
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carry out in the laboratory. Experiments with short-wavelength matter 
waves require the smallest available structures with slits on the scale of 
100 nm. 

A double-slit experiment was carried out with a beam of helium atoms 
in the metastable 1s2s 7S, level that lies 20eV above the ground state 
(Carnal and Mlynek 1991). This matter-wave experiment had the same 
layout as that shown in Fig. 11.1, although with atoms the interfer- 
ometer has to be set up inside a vacuum chamber. Exercise 11.2 goes 
through the application of the equations given in this section to the cal- 
culation of the slit widths required to observe interference fringes with 
He*. Metastable helium is very suitable for this experiment, firstly be- 
cause it has a fairly long Agg and secondly when metastable atoms hit 
a surface they release sufficient energy to eject electrons; counting these 
charged particles allows the arrival of individual atoms to be detected 
with high efficiency. Further discussion of double-slit experiments can be 
found in the quantum mechanics book by Rae (1992); he uses a neutron 
interference experiment as an example of wave—particle duality. 


11.2 A diffraction grating for atoms 


Figure 11.2 shows an apparatus with a highly-collimated atom beam 
of sodium incident upon a transmission grating. The experimenters 
used a remarkable grating with slits only 50nm wide, spaced 100nm 
apart—equal widths of the bars and the gaps between them. Etching 
these very thin bars and their delicate support structure represents the 
state of the art in nano-fabrication. Figure 11.2(b) shows the diffraction 
pattern obtained with a mixture of sodium atoms and molecules, and 
Fig. 11.2(c) shows the diffraction of a beam of sodium molecules. The 
diffraction peaks of Naz have about half the spacing of those for the Na 
atoms, as expected from the de Broglie relation for particles of twice the 
mass (for similar velocities). 
Researchers recently exploited this property of these special gratings 
to make the first experimental observation of the very weakly-bound 
state of two helium atoms (Schdélikopf and Toennies 1994). Other meth- 
ods of detection dissociate the very tenuously-bound Hez molecule. The 
gratings work well with helium and atomic beams of inert gases, since 
they do not clog the slits in the same way as sodium. However, despite 
the practical difficulties of working with these very fragile structures, 
a recent experiment used a grating to diffract Cg9 molecules—so-called 
Buckyballs (Arndt et al. 1999, Nairz et al. 2003). This demonstration 
of the wave-like nature of such massive particles prompts the following 
question: ‘What is the largest object for which such quantum inter- 
ference can be observed?’® This question relates to Schrédinger’s well- ©This means quantum effects in the 
known example of a cat that may be in a superposition of two states tion, or external degrees of freedom, 
(‘alive’ and ‘dead’). To observe interference an object must exist in oer Ole eae 
a superposition of the two states: |1) in which it goes through slit ¥4 ; 
and |2) the state where it goes through slit 42. Nothing within standard 
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Fig. 11.2 (a) Diffraction of a collimated atomic beam by a grating. To observe the diffraction of matter waves from the grating 
the source slit must be sufficiently narrow to make the matter waves coherent across several of the slits in the grating. This 
is the same requirement as described in the previous section for Young’s double slits, and Exercise 11.1 looks at the relation 
of the pattern with multiple slits to that observed with just two slits. For the grating there is an additional requirement that 
the angular spread of the incident beam must be less than the angle between the diffracted orders, otherwise they cannot be 
distinguished. In this apparatus the slit widths were about 20m, the slits in the nano-fabricated grating had a spacing of 
100 nm and all the distances Lc, L’ and L were about 1m. (b) The diffraction of a collimated beam of sodium atoms and 
molecules by the grating. (c) The diffraction pattern for a beam that contains only Naz molecules (this pattern is also shown 
as a dotted curve in (b)). The peaks for the molecules have half the spacing of those for the atoms as expected for twice the 
mass—the atoms and molecules have almost the same velocity in the supersonic flow because they are both carried along in a 
stream of krypton gas that flows through the heated oven containing sodium metal. This carrier gas gives a supersonic beam 
with a much lower velocity spread Au/v ~ 0.03 than would be obtained from an effusive source of thermal atoms. The sodium 
atoms were removed from the beam by the resonant radiation pressure from a laser beam (not shown) perpendicular to the 
supersonic beam—scattering three or more photons was sufficient to deflect atoms out of the beam. From Chapman et al. 
(1995). Copyright 1995 by the American Physical Society. 


quantum mechanics tells us that we cannot put something as large as 
a cat into a quantum superposition, if it is completely isolated from 
external perturbations. However, the heaviest object that can be used in 
a practical double-slit experiment in the foreseeable future is far lighter 
than a cat, but considerably heavier than what has been achieved so far. 
Continued work on matter-wave interference of larger and larger objects 
will be of great interest since it probes the boundary between quantum 
and classical physics. 


11.3. The three-grating interferometer 


Figure 11.3 shows an arrangement of three diffraction gratings a distance 
L apart. A highly-collimated beam of sodium atoms propagates through 
this three-grating interferometer onto a detector for atoms.’ Diffraction 
at the first grating G1 splits the beam only the zeroth- and first-order 
diffraction orders (0 and +1 orders) have been drawn for simplicity. The 
second grating G2 gives diffraction through the same angles as G1, so 
that some of the paths meet up at the plane of the third grating G3, 
e.g. the 0 and +1 orders from G1 are both diffracted by G2 to form the 
parallelogram ABPC, as shown in Fig. 11.3(a). The detector records the 
flux of atoms along one of the possible output directions coming from 
P. This arrangement closely resembles a Mach-Zehnder interferometer 
for light, with a smaller angle between the two arms because of the 
achievable grating spacing. For two-beam interference the signal has 
the same form as eqn 11.3. In these interferometers the sum of the 
fluxes of the atoms, or light, in the two possible output directions equals 
a constant, i.e. when a certain phase difference between the arms of the 
interferometer gives destructive interference at the detector then the flux 
in the other output direction has a maximum. 


11.4 Measurement of rotation 


The Mach-Zehnder interferometer for matter waves shown in Fig. 11.3 
measures rotation precisely, as explained in this section.® To calculate 
the phase shift caused by rotation in a simple way we represent the 
interferometer as a circular loop of radius R, as in Fig. 11.4. The wave 
travelling at speed v from the point S takes a time t = 7R/v to propagate 
around either arm of the interferometer to the point P diametrically 
opposite S. During this time the system rotates through an angle Mt, 
where 2 is the angular frequency of rotation about an axis perpendicular 
to the plane of the interferometer. Thus the wave going one way round 
the loop has to travel Al = 20. Rt further than the wave in the other arm 
of the interferometer. This corresponds to Al/Aap extra wavelengths, 
or a phase shift of 

Ao as x 20x lh : 


2 11.9 
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’The detector for sodium has a hot 
wire, heated by a current flowing 
through it, that runs parallel to the 
slits. The sodium atoms ionize when 
they hit the hot surface of the wire and 
the ejected electrons create a measur- 
able current. 


8 For light a different configuration 
called a Sagnac interferometer is gen- 
erally used to measure rotation but the 
principles are similar. 
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Fig. 11.3 (a) An interferometer formed by three diffraction gratings spaced by a distance L along the atomic beam. A 
collimated beam of atoms is produced, as shown in Fig. 11.2. Waves diffracted at the first grating G1 split again at G2, so 
that some of the paths meet at G3. Only the 0 and +1 diffraction orders are shown and to further simplify the diagram some 
of the possible paths between G2 and G3 have not been drawn completely. Contributions to the amplitude at P arrive from 
A, via either B or C. The detector must be sufficiently far from G3 that it picks up only one of two possible output directions. 
(The parallelogram ABPC is just one of many closed loops formed by the three gratings; some others are indicated by dotted 
lines. With the detector at the position shown, the three gratings act as a Mach -Zehnder interferometer, as shown in (b). The 
diffraction gratings behave both as beam splitters and as deflectors (mirrors) for the matter waves (for small angles). (c) A 
Mach-Zehnder interferometer for light—the optical system equivalent to the three-grating interferometer. The incident wave 
hits beam splitter BS1 and the reflected and transmitted amplitudes reflect off mirrors M1 and M2, respectively, so that their 
paths meet again at BS2. Interference between the two paths leads to a detected intensity Ip = $1o{1 + cos(@ + Ad)} (cf. 
eqn 11.3). The phase ¢ that arises from path length differences and phase shifts on reflection at the mirrors is assumed to be 
fixed and A¢ represents the extra phase that is measured; e.g. for an interferometer that rotates at angular frequency 2 about 
an axis perpendicular to the plane of the instrument Ad « 2, so the instrument measures rotation, as shown in Section 11.4. 


The loop has area A = 7R?, so that 


Ar 
Ad = 
‘g \dBU 


x A. (11.10) 


A more rigorous derivation, by integration around a closed path, shows 
that this equation applies for an arbitrary shape, e.g. the square interfer- 
ometer of Fig. 11.3(c). Comparison of this phase shift for matter waves 
of velocity v with that for light Agdipnt, for an interferometer of the same 


11.5 


area A, shows that 


2 

Ag= cs x Adright = we x Adiignt - 
The ratio equals the rest mass of the atom divided by the energy of each 
photon and has a value of Ad¢/Agyignt ~ 101° for sodium atoms and 
visible light. This huge ratio suggests that matter-wave interferometers 
have a great advantage, but at the present time they only achieve com- 
parable results to conventional interferometers with light. Conventional 
interferometers with light make up the ground by: 


(11.11) 


(a) having much larger areas, i.e. a distance between the arms of metres 
instead of a fraction of a millimetre achieved for matter waves; 


(b) the light goes around the loop many times;? 


(c) lasers give a much higher flux than the flux of atoms in a typical 
atomic beam. For example, in the scheme shown in Fig. 11.2 only 
a small fraction of the atoms emitted from the source end up in 
the highly-collimated atomic beam; as a source of matter waves the 
atomic oven is analogous to an incandescent tungsten light bulb 
rather than a laser.!° 


11.5 The diffraction of atoms by light 


A standing wave of light diffracts matter waves, as illustrated in Fig. 11.5. 
This corresponds to a role reversal as compared to optics in which mat- 
ter, in the form of a conventional grating, diffracts light. This section 
explains how this light field created simply by the retro-reflection of 
a laser beam from a mirror is used in atom optics. The interaction 
of atoms with a standing wave leads to a periodic modulation of the 
atomic energy levels by an amount proportional to the intensity of the 
light, as explained in Section 9.6.'! The light shift of the atomic energy 
levels in the standing wave introduces a phase modulation of the matter 
waves. An atomic wavepacket ~(a,z,t) becomes 7(z, z,t) e'4?) im- 
mediately after passing through the standing wave; it is assumed that 
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Fig. 11.4 (a) A simplified diagram 
of an interferometer where the waves 
propagate from S to P. (b) Rotation at 
angular frequency 2 about an axis per- 
pendicular to the plane of the interfer- 
ometer makes one path Qt longer and 
the other shorter by the same amount, 
where t is the time taken for a wave to 
travel from S to P. This leads to the 
phase shift in eqn 11.10. 


9 Laser gyros use high-reflectivity mir- 
rors or optical fibres. 


10The atom interferometer based on 
Raman transitions (described in Sec- 
tion 11.5.1) does not require such a 
highly-collimated atomic beam, and 
that technique has achieved precise 
measurements of rotation. 


11The laser has a frequency w sufh- 
ciently far from the atom’s transition 
frequency wo that spontaneous emis- 
sion has a negligible effect, yet close 
enough for the atoms to have a signifi- 
cant interaction with the light. 
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Fig. 11.5 The diffraction of atoms by 
a standing wave light field. The an- 
gle of the first order of diffraction 6 
is related to the grating period d by 
dsin@ = Agp. For the standing wave 
d = /2, so sin@ = 2Agp/A. This dif- 
fraction can be regarded as a scattering 
process where an atom of momentum p 
receives an impulse that gives it trans- 
verse momentum Ap and deflects it by 
an angle @ given by tan9 = Ap/p. 


12This is obvious from a treatment 
of diffraction by Fourier transforms 
(Brooker 2003). 
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w(x, z,t) changes smoothly over a length scale much greater than 4/2. 
This phase modulation has a spatial period of 4/2, where A is the wave- 
length of the light not the matter waves. The matter waves accumulate 
an additional phase A¢(x) = ¢o cos?(27a2/A) from the light shift. This 
phase grating diffracts the matter waves at angles determined by 


dsin@ = nrAgp , (11.129 


where d = 4/2, n is an integer and Agp gives the matter wavelength. 
Gratings with the same spacing d diffract waves by the same angles 6 
whether they work by phase or amplitude modulation of the incident 
wave,!” although not with the same relative intensities in different or- 
ders. Thus there is no fundamental difference between nano-fabricated 
absorption gratings and the use of standing waves. Interferometers that 
use three standing waves in the same arrangement as in Fig. 11.3 have 
similar properties to an instrument with three gratings etched from solid 
material. The mirrors that retro-reflect the light to form the stand- 
ing waves must be mounted rigidly so that vibrations do not wash out 
the interference fringes (similarly, the nano-fabricated gratings must be 
held very stable). Standing waves of visible light give diffraction angles 
about three times less than those from the best nano-fabricated gratings; 
however, such gratings transmit all the atoms, whereas nano-fabricated 
gratings transmit much less than the 50% that might be expected for 
bars that have a width equal to the gaps between them—the very thin 
bars require an elaborate support structure of cross-bars that reduces 
the open area. Also, material gratings eventually get clogged up when 
used with alkali metals. 

The diffraction of the matter waves by a standing wave has an alter- 
native physical interpretation in terms of the scattering of light. The 
diffraction condition in eqn 11.12 can be written as tan @ ~ nhG/p (tak- 
ing tan @ ~ sin @ for small angles), where p is the longitudinal momentum 
of the atoms, G = 47/X is the characteristic wavevector of a structure 
with spacing A/2 (i.e. the grating) and AG = 2h/) equals the momentum 
of two photons. Thus, in the diffraction from the standing wave, atoms 
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receive a transverse kick from 2n photons, as illustrated in Fig. 11.5, e.g. 
first order arises from absorption from one of the counter-propagating 
beams and stimulated emission into the other beam. This coherent 
process, with no spontaneous emission, has some similarities with the 
Raman transition shown in Fig. 11.6.!° 


11.5.1 Interferometry with Raman transitions 


The description of the diffraction of two-level atoms by a standing wave, 
in the previous section, as a coherent scattering process that imparts 
twice the photon momentum 2h/A, or multiples thereof, to the atom 
has links with the very powerful method for manipulating the atom’s 
momentum by Raman transitions shown in Fig. 11.6. Two laser beams 
at frequencies wy; and wy drive a coherent Raman transition between 
states |1) and |2) when 


A (wri — WL) = E> = FE, - Chit) 


No population goes into the intermediate state |7) in this coherent transi- 
tion because neither of the two beams excites a single-photon transition 
(see Appendix E). The Raman transition couples states |1) and |2) and 
drives Rabi oscillations between them, e.g. when the atom starts in either 
|1) or |2), a 7/2-pulse creates a superposition of |1) and |2) with equal 
amplitudes. Raman laser beams propagating in opposite directions (as 
in Fig. 11.6) change the atom’s momentum during the transition.1* The 
absorption of a photon of wavevector k, and the stimulated emission of 
one in the opposite direction kz ~ —k; gives the atom two recoil kicks in 
the same direction. This process couples the state |1,p) to |2,p+ 2hk). 
The bra(c)ket notation denotes the |internal state, momentum) of the 
atoms. The Raman resonance condition in eqn 11.13 depends sensitively 
on the atom’s velocity v for counter-propagating beams and this provides 
the basis for the Raman cooling of atoms (Section 9.8). For interferome- 
try this velocity selectivity is a complicating factor and we shall assume 
that the Raman pulses are sufficiently short! to drive transitions over 
the whole range of velocity components along the laser beam. 

Figure 11.7 shows a complete Raman interferometer where the atoms 
start in |1,p) and travel through three Raman interaction regions. In 


|2, p + 2hk) 


|1, p) 


13 scattering in a standing wave 
changes the atomic momentum (ex- 
ternal state) but not the internal 
State. 


14 aman laser beams travelling in the 
same direction have the same effect as 
direct coupling between |1) and |2) by 
microwaves. 


19 According to the condition in 
eqn 9.58. 


Fig. 11.6 A Raman transition with 
two laser beams of frequencies wy, and 
wy2 that propagate in opposite direc- 
tions. Equation 11.13 gives the res- 
onance condition, ignoring the effects 
of the atom’s motion (Doppler shift). 
The Raman process couples |1,p) and 
|2,p + 2hk) so that an atom in a Ra- 
man interferometer has a wavefunction 
of the form wy = All, p) + B|2,p + 2hk) 
(usually with either B = 0 or A = 0 
initially). 
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Fig. 11.7 An interferometer formed by Raman transitions. As atoms traverse the three interaction regions they experience a 
n /2-n~n /2 sequence of Raman pulses that split, deflect and recombine the atomic wavepackets. Each interaction region has two 
counter-propagating beams at frequencies wy, and wy 2, as in Fig. 11.6. In this Mach -Zehnder interferometer the eigenstates 
with transverse momentum p and p + 2hk are associated with the different internal atomic states |1) and |2), respectively; 
as indicated at the bottom of the figure, |1) is associated with p ~ 0 (horizontal line) and |2) with p ~ 2hk (slanted line). 
Therefore at the output it is only necessary to measure the internal state of the atoms, e.g. by exciting a transition from |2) and 
detecting the fluorescence, rather than allowing beams with different momenta to become spatially separated, as in Fig. 11.3. 
(The separation of the paths has been exaggerated for clarity.) 


16 This phase, and the other similar 
phases that arise at each interaction, 
lead to an offset in the final output 
which is not important. But these 
phases must remain constant in time, 
otherwise the interference ‘washes out’. 


Mina three-grating interferometer only 
a fraction of the amplitude goes in the 
required direction. Note, however, that 
the simple treatment of standing waves 
assumed the ‘thin’ grating approxima- 
tion, but often the interaction between 
the matter waves and light takes place 
over a sufficiently long distance that 
Bragg diffraction occurs (as in crys- 
tals). 


the first interaction the atom experiences a 7/2-pulse that puts it into 
the superposition 


(wb) = {|1,p) + el® 2, p + 2hk)}. (11.14) 


The phase factor depends on the relative phase of the two laser beams.!® 
These two states separate, as shown in Fig. 11.7, and this first region cor- 
responds to a beam splitter for matter waves. After a free-flight through 
a distance L the atom enters the middle interaction region where it un- 
dergoes a z-pulse that acts on both arms of the interferometer to swap 
the states |1,p) © |2,p+ 2hk). (In this apparatus the transit time for 
the atom to pass through the laser beams determines the duration of 
the Raman interaction.) The paths come back together after a further 
distance L and the final 7/2-pulse acts as the beam splitter that mixes 
the wavepackets to give interference. The complete 7/2—n—m/2 sequence 
gives a Mach-Zehnder interferometer. A comparison of Figs 11.3 and 
11.7 shows that the Raman scheme resembles the Mach—Zehnder inter- 
ferometer more closely than the three-grating device; the Raman scheme 
does not direct any amplitude in unwanted directions and the middle in- 
teraction region in the Raman interferometer acts just like a mirror to 
change the direction (transverse momentum) of both paths through a 
small angle.!” 

A Raman pulse and standing light wave give the same opening an- 
gle between the arms for a given wavelength of laser light and both 


schemes use light whose frequency is detuned from the atomic transition 
to avoid spontaneous emission. A crucial difference between these meth- 
ods arises in the detection. The three-grating apparatus, with standing 
waves or nano-fabricated structures, distinguishes the two outputs by 
their different directions. Therefore the three-grating devices require a 
highly-collimated atomic beam at the input whose angular divergence is 
less than the angle between the two output directions Ogi¢. The output 
channels of the Raman scheme are the two different states |1,p) and 
|2,p+ 2hk), as shown in Fig. 11.7; thus experiments only need to de- 
termine the final state of the atom, e.g. using a laser beam that excites 
a transition from |2) to another state that gives fluorescence for atoms 
in state |2) but not for those in |1).!° This means that Raman inter- 
ferometers use more of the atoms from a given source because they do 
not need to have tight collimation.1? Although the flux of atoms does 
not affect the size of the phase shift given by eqn 11.10, the strength 
of the measured signal determines how precisely that phase shift can be 
measured, i.e. the interferometer measures a smaller fraction of a fringe 
if the signal-to-noise ratio is higher.2° Thus the type of Raman inter- 
ferometer shown in Fig. 11.7 measures rotation more precisely than a 
three-grating device. 


11.6 Conclusions 


Matter-wave interferometers for atoms are a modern use of the old idea 
of wave—particle duality and in recent years these devices have achieved 
a precision comparable to the best optical instruments for measuring 
rotation and gravitational acceleration. We have seen examples of ex- 
periments that are direct analogues of those carried out with light, and 
also the Raman technique for manipulating the momentum of atoms 
through their interaction with laser light, as in laser cooling. Laser 
cooling of the atom’s longitudinal velocity, however, only gives an ad- 
vantage in certain cases (see the section on further reading).”1 Similarly, 
the high-coherence beams, or atom lasers, made from Bose condensates 
do not necessarily improve matter-wave devices—in contrast to the al- 
most universal use of lasers in optical interferometers. Partly, this arises 
because of the interactions between the atoms themselves, as discussed 
in the derivation of the nonlinear Schrodinger equation in Chapter 10, 
which lead to phase shifts that depend on the atomic density. So far 
interferometry experiments that use BEC have been performed to find 
out more about the condensate itself, rather than as instruments for 
precision measurement of physical quantities. The interaction of atoms 
with the periodic potential produced by a standing wave gives a lot of 
interesting physics, in addition to the diffraction described here, and we 
have only scratched the surface of atom optics. 
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1845 in the atomic fountain described 
in Section 9.9. 


19The creation of the two Raman 
beams with a well-defined frequency 
difference, and other technical details, 
are described in Section 9.8. 


20 This argument assumes that it is 
purely statistical fluctuations (noise) 
that limit the precision, not systematic 
shifts. 


21 The Ramsey fringes produced by 
atomic fountain clocks arise from in- 
terference of the internal (hyperfine) 
states of atoms, but in this chapter the 
‘atom interferometer’ has been reserved 
for cases where there is spatial separa- 
tion between the two arms. 
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Further reading 


22-Phis journal is a useful source of sim- The review in Contemporary physics?” by Godun et al. (2001) surveys 

ilar articles. the field of atom interferometry at a level suitable for undergraduates, 
including important applications, such as the precision measurement of 
gravitational acceleration g, that have not been included here. The 
monograph Atom interferometry edited by Berman (1997) is a rich 
source of information on this subject. 
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Exercises 


(11.1) Comparison of double- and multiple-slit (11.3) Measurement of the van der Waals interaction 


diffraction 


(a) Explain in simple physical terms why the 
diffraction orders of a grating occur at the 
same angles as the constructive interference 
between a pair of slits with the same spacing 
as those in the grating. 


(b) Monochromatic light passes through a trans- 
mission diffraction grating. Initially most of 
the grating is covered with opaque sheets of 
material so that the light illuminates only two 
adjacent slits in the middle of the grating. 
The grating is gradually uncovered until fi- 
nally light falls on the whole grating. Describe 
how the intensity, spacing and shape of the ob- 
served far-field diffraction changes? 


(11.2) Young’s slits with atoms 


(a) Calculate Agp for metastable helium atoms 
from a source at 80K. 


(b) Find the source slit width ws such that the 
diffracted wave spreads out to coherently illu- 
minate two slits separated by d = 8 yum when 
L’ = 0.6m in Fig. 11.1 (the conditions for the 
experiment of Carnal and Mlynek (1991)). 


23 Brooker (2003). 
?4Based on the experiment of Grisenti et al. (1999). 


with a nano-fabricated grating 
Diffraction by a grating with slits of width a and 
spacing d gives an intensity distribution of?* 


Here u = 27sin6/Aap and the angle is defined in 
Fig. 11.1. All of the parts of this exercise refer to 
a grating with d = 2a = 100 nm. 


(a) Sketch the intensity distribution for 0 < u < 
107. 


(b) What is the intensity of the second order? 


(c) An experimental observation of the diffraction 
of rare gas atoms from the grating found that 
the intensity of the second order is 0.003 Jo for 
helium and 0.05 Jo for krypton. The differ- 
ence in these values was ascribed to the van 
der Waals force, which is strongest for large 
atoms. Therefore a krypton atom on a trajec- 
tory that goes close to the sides of the slit feels 
a force that deflects it through a large angle or 
causes it to crash into the grating. These pro- 
cesses effectively reduce the slit width from a 
to a—2r, where r is the typical van der Waals 
range. Estimate r for krypton atoms.”* 


lon traps 


This chapter describes the principal methods of building ion traps and 
a few of their many applications in physics. The examples illustrate 
the extremely high-resolution spectroscopy possible with microwave and 
laser radiation. The theme of precision measurement in an environment 
with very few perturbations continues in the next chapter on quantum 
computing—an application that has stimulated a new wave of research 
on trapped ions. 


12.1 The force on ions in an electric field 


Charged particles in electromagnetic fields experience much larger forces 
than neutral atoms. An ion with a single charge e = 1.6 x 10-!9C in an 
electric field of 10° Vm~! experiences a force 

Fion =CE = 10 “N. (12.1) 
This electric field corresponds to 500V between electrodes which are 
5mm apart.' In comparison, a neutral atom with a magnetic moment of 
one Bohr magneton in a magnetic field gradient? of dB/dz = 10T m7! 
experiences a force of magnitude 


Pearl = Lp (12:2) 


dB 
=| a0 IN, 
= : 


Ions feel a force 10° times greater than magnetically-trapped neutrals. 
We also see this large difference in a comparison of trap depths. In a 
trap operating with a voltage of Vo = 500 V, singly-charged ions have 
a maximum ‘binding’ energy of order 500eV.? This trap depth corre- 
sponds to the kinetic energy at a temperature of 6 x 10° K. This is more 
than enough to trap ions, even if the ions do receive a large recoil kick 
during the ionization process—to load an ion trap experimenters send 
a weak (neutral) atomic beam through the trapping region where an 
electron beam ionizes a few of the atoms by knocking an electron off. 
These ions created by electron bombardment have much greater kinetic 
energy than the thermal energy of atoms at room temperature (equiv- 
alent to only 1/40eV). It would be unwise to try to be more precise 
about the typical energy of an ion since it depends on the voltage used. 
In contrast, a magnetic trap for neutrals has a maximum depth of only 
0.07 K. This was estimated in Section 10.1 by taking the magnetic en- 
ergy upB for B = 0.1T, e.g. the force in eqn 12.2 over a distance of 
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'This assumes electrodes in the form 
of a parallel-plate capacitor. Although 
ion traps have a different geometry, this 
still gives a reasonable estimate and 
shows that the electrostatic force gives 
strong trapping for a voltage readily 
available in the laboratory. 


? This value is typical of magnetic traps 
with coils wound with copper wire. Su- 
perconducting magnets give higher gra- 
dients. 


3We consider ions with a single posi- 
tive charge +e such as Mgt, Cat and 
Hg*, since few experiments use species 
that acquire additional electrons to give 
negative ions. Section 12.8 deals with 
highly-charged ions. 
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4The theorem dates back to the 
nineteenth century and James Clerk 
Maxwell discussed it in his famous trea- 
tise on electromagnetism. 


5The derivation of this equation from 
the Maxwell equation div D = Pfree as- 
sumeS (free = 0 and a linear isotropic 
homogeneous medium in which D = 
€r€oE with e, constant. Ions are usually 
trapped in a vacuum, where e; = 1. 


®It takes just a few lines of algebra to 
prove this from Laplace’s equation. 


10mm. These estimates show that neutral atoms must be cooled before 
trapping but ion trapping requires only moderate electric fields to cap- 
ture the charged particles directly. It is not straightforward, however, | 
to find a suitable electric field configuration and, as in many advances — 
within atomic physics, the success of ion trapping relies on some subtle — 
ideas rather than a brute-force approach. 


12.2 Earnshaw’s theorem 


Earnshaw proved that: A charge acted on by electrostatic forces cannot 
rest in stable equilibrium in an electric field. 

Thus it is not possible to confine an ion using a purely electrostatic 
field. Physicists have invented ingenious ways around this theorem but, 
before describing the principles of these ion traps, we need to think 
about the underlying physics. The theorem follows from the fact that 
an electric field has no divergence in a region with no free charge density, 
div E = 0.° Zero divergence means that all the field lines going into a 
volume element must come out—there are no sources or sinks of field 
within the volume. Equivalently, Gauss’ theorem tells us that the inte- 
gral of the normal component of E over the bounding surface equals the 
volume integral of div E, which is zero: 


peas= [ff dive dr =o. 


Hence E.- dS cannot have the same sign over all of the surface. Where 
E-dS < 0 the electric field points inwards and a positive ion feels a 
force that pushes it back into the volume; but E-dS > 0 somewhere 
else on the surface and the ion escapes along that direction. A specific 
example of this is shown in Fig. 12.1 for the field produced by two equal 
positive charges with a fixed separation along the z-axis. Midway be- 
tween the charges, at the point labelled P, the electric fields from the 
two charges cancel and the ion experiences no force, but this does not 
give stable equilibrium. The argument above holds true, so the electric 
field lines around the point P cannot all be directed inwards. When 
slightly displaced from P, a positive ion accelerates perpendicular to the 
axis, whereas a negative ion would be attracted towards one of the fixed 
charges. This behaviour can also be explained by the fact that the point 
P is a saddle point of the electrostatic potential ¢. The electrostatic 
potential energy e@ of the ion has the same form as the gravitational 
potential energy of a ball placed on the saddle-shaped surface shown in 
Fig. 12.2—clearly the ball tends to roll off down the sides. In this alter- 
native way of looking at Earnshaw’s theorem in terms of electrostatic 
potential rather than the fields, stable trapping does not occur because 
the potential never has a minimum, or maximum, in free space.® 


(12.3) 


A Rotation 


i2.o Lhe Paul trap 


The analogy with a ball moving on the saddle-shaped surface shown in 
Fig. 12.2 provides a good way of understanding the method for confining 
ions invented by Wolfgang Paul. The gravitational potential energy of 
the ball on the surface has the same form as the potential energy of an 
ion close to a saddle point of the electrostatic potential. We assume 
here a symmetric saddle whose curvature has the same magnitude, but 
opposite sign, along the principal axes: 


(12.4) 
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where x’ = rcosM# and y’ = rsin{Mt are coordinates in a frame rotating 
with respect to the laboratory frame of reference. The time average of 
this potential is zero. Rotation of the saddle shape around the vertical 
axis turns the unstable situation into stable mechanical equilibrium, and 
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Fig. 12.1 The electric field lines be- 
tween two equal positive charges. Mid- 
way between the charges at the point P 
the electric fields from the two charges 
cancel. At this position the ion expe- 
riences no force but it is not in stable 
equilibrium. At all other positions the 
resultant electric field accelerates the 
ion. 


Fig. 12.2 A ball on a saddle-shaped 
surface has a gravitational potential en- 
ergy that resembles the electrostatic 
potential energy of an ion in a Paul 
trap. Rotation of the surface about a 
vertical axis, at a suitable speed, pre- 
vents the ball rolling off the sides of the 
saddle and gives stable confinement. 
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7 Although we shall not analyse the me- 
chanical system in detail, it is impor- 
tant to note that the wobbling motion 
is not entirely up and down but has ra- 
dial and tangential components. Simi- 
larly, an object that floats on the sur- 
face of water waves does not just bob up 
and down but also oscillates back and 
forth along the direction of propagation 
of the waves, so its overall motion in 
space is elliptical. The discussion only 
applies for mechanical systems where 
friction has a negligible effect so that 
the ball slides smoothly over the sur- 
face. 


8The a.c. field does not change the 
ion’s average total energy because the 
work done on the ion given by F-r « 
cos(Mt) sin(Nt) averages to zero over 
one cycle—the force and velocity have 
a phase difference of 71/2. 


makes an impressive lecture demonstration. Such dynamic stabilisation 
cannot honestly be described as ‘well known’ so it is explained carefully 
here by an approximate mathematical treatment of ions in an a.c. field. 


12.3.1 Equilibrium of a ball on a rotating saddle 


In the mechanical analogue, the rotation of the saddle at a suitable speed 
causes the ball to undergo a wobbling motion; the ball rides up and down 
over the low-friction surface of the rotating saddle shape and the ball’s 
mean position only changes by a small amount during each rotation.” 
The amplitude of this wobbling increases as the ball moves further from 
the centre of the saddle. For this oscillatory motion the time-averaged 
potential energy is not zero and the total energy (potential plus kinetic) 
increases as the object moves away from the centre. Therefore the mean 
position of the object (averaged over many cycles of the rotation) moves 
as if it is in an effective potential that keeps the ball near the centre of 
the saddle. We will find that an ion jiggling about in an a.c. field has a 
similar behaviour: a fast oscillation at a frequency close to that of the 
applied field and a slower change of its mean position. 


12.3.2 The effective potential in an a.c. field 


To explain the operation of the Paul trap, we first look at how an ion 
behaves in a.c. electric field E = Ep cos(Qt). An ion of charge e and 
mass M feels a force F = eEo cos(Qt), and so Newton’s second law gives 


Mr = eEo cos(Mt) . (12:5) 


Two successive integrations give the velocity and displacement as 


(12.6) 


It has been assumed that the initial velocity is zero and ro is a constant 
of integration. The field causes the ion to oscillate at angular frequency 
Q with an amplitude proportional to the electric field. From this steady- 
state solution we see that the forced oscillation does not heat the ions.® 
(These simple steps form the first part of the well-known derivation of 
the plasma frequency for a cloud of electrons in an a.c. field, given in most 
electromagnetism texts.) The following section describes an example of 
this behaviour in which the amplitude of the electric field changes with 
position Eo(r). 


12.3.3 The linear Paul trap 


In a linear Paul trap the ion moves in the field produced by the electrodes 
shown in Fig. 12.3. The four rods lie parallel to the z-axis and at the 
corners of a square in the zy-plane. Each electrode is connected to the 
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Fig. 12.3 A linear Paul trap used to store a string of ions. (a) A view looking along the four rods with the end-cap electrode 
and ions in the centre. Each of the rods is connected to the one diagonally opposite so that a voltage between the pairs gives 
a quadrupole field. (b) A side view of the rod and end-cap electrodes which have a.c. and positive d.c. voltages, respectively. 


A string of trapped ions is indicated. 


one diagonally opposite and the a.c. voltage V = Vo cos (Mt) is applied 
between the two pairs. Despite the fact that the voltages vary with time, 
we first find the potential by the usual method for electrostatic prob- 
lems. The electrostatic potential ¢ satisfies Laplace’s equation V7¢ = 0 
(because div E = 0 and E = —V¢@). A suitable solution for the potential 
close to the z-axis, that matches the symmetry of the voltages on the 
electrodes, has the form of a quadrupole potential 


b = do + a2(x* — y”). (12.7) 


The coefficients a@9 and a2 are determined from the boundary conditions. 
There are no terms linear in x or y because of the symmetry under 
reflection in x = 0 and y = 0. The terms in x? and y? have opposite 
signs, and the variation with z is negligible for rods much longer than 
their separation 2r9. The potential must match the boundary conditions 


b= bo + 2 cos (Mt) ee ere 


V, (12.8) 
b= do — | co (Mt) at, z=0, y=+ro. 
These conditions are satisfied by the potential? 
VY 
¢=¢o+ ae cos (Nt) (2? — y”). (12.9) 
0 


To solve Laplace’s equation we have simply made a reasonable guess, 
taking into account the symmetry. This is perfectly justified since the 
uniqueness theorem says that a solution that fits the boundary condi- 
tions is the only valid solution (see electromagnetism texts). The usual 


° This ignores the finite size of the elec- 
trodes and that the inner surfaces of the 
electrodes would need to be hyperbolic, 
e.g. a surface given by x? —y? = const., 
to match the equipotentials. However, 
by symmetry this potential has the cor- 
rect form for r < rg, no matter what 
happens near to the electrodes. 
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10-This method would not be appropri- 
ate for shorter wavelengths, e.g. micro- 
wave radiation with frequencies of GHz. 


‘lhe two-dimensional quadrupole 
field between the four rods looks 
superficially like the quadrupole field 
with cylindrical symmetry in Fig. 12.1 
and the analogy with a rotating saddle 
applies to both. A comparison of 
the potentials for the two cases in 
eqn 12.9 and eqn 12.23 shows that 
they are different. Note also that the 
electrostatic potential oscillates ‘up 
and down’ rather than rotating as in 
the mechanical analogy. 


12-The Mathieu equation arises in a va- 
riety of other physical problems, e.g. 
the inverted pendulum. A pendulum is 
normally considered as hanging down 
from its pivot point and undergoing 
simple harmonic motion with a small 
amplitude. In an inverted pendulum 
a rod, that is pivoted at one end, ini- 
tially points vertically upwards; any 
slight displacement from this unsta- 
ble position causes the rod to fall and 
swing about the stable equilibrium po- 
sition (pointing straight down), but if 
the pivot point oscillates rapidly up 
and down then the rod remains up- 
right whilst executing a complicated 
motion—the rod can make quite large- 
angle excursions from the vertical direc- 
tion without falling over. The math- 
ematical textbook by Acheson (1997) 
gives further details of the complexities 
of this fascinating system and numeri- 
cal simulations can be seen on the web 
site associated with that book. 


I3-This corresponds to the motion of an 
ion in a trap that has no d.c. voltage. In 
practice, ion traps may have some d.c. 
voltage because of stray electric fields 
but this can be cancelled by applying 
a suitable d.c. voltage to the electrodes 
(or additional electrodes near the four 
rods). The solution of the Mathieu 
equation with az 4 0 is discussed in 
the book on ion traps by Ghosh (1995). 


14\fore detailed mathematical treat- 
ments of the Mathieu equation can be 
found in Morse and Feshbach (1953) 
and Mathews and Walker (1964). 


method for solving an electrostatic problem applies even though the volt- 
age on the electrodes changes because at the radio-frequencies (used in 
ion traps) the radiation has a wavelength much greater than the dimen- 
sions of the electrodes, e.g. a wavelength.of 30m for 2 = 27 x 10 MHz."° 
The potential energy e¢ of an ion has a saddle point in the middle of 
these electrodes that looks like the saddle shape shown in Fig. 12.2—a 
potential ‘hill’ in the z-direction and a ‘valley’ in the y-direction, or the 
other way around.!! From the gradient of potential we find the electric 
field 


E = Eo (r) cos (Mt) 


12.10 
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The equation of motion in the x-direction is 


d?z eVo 
oo bce 1240 
7 7 cos (Qt) x ( ) 
A change of variable to 7 = Nt/2 leads to 
d?z 4eVo 
Aue) = ~ Mrz Cos (27) 2. (12:12) 
This is a simplified form of the Mathieu equation:1” 
2 
SEGe AMI Ps ip (12.13) 
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with a, = 0.'° It is conventional to define the parameter in front of the 
oscillating term as 2q, (in anticipation of this e has been used for the 


ion’s charge), where 
2eVo 


We look for a solution of the form 
x = a9 cos Ar {1+ Bcos2r}. (12.15) 


The arbitrary constant A gives the angular frequency of the overall mo- 
tion and B is the amplitude of the fast oscillation at close to the driving 
frequency. The justification for choosing this form is that we expect an 
oscillating driving term to produce a periodic solution and substitution 
of a function containing cos At into the equation leads to terms with 
cos Ar cos 27.'* Substitution into the equation (with a, = 0) gives 


ato [—4B cos Ar cos 27 + 4AB sin At sin 27 — A? cos At {1 + B cos 27}] 


= 2q¢7X9 cos 27 cos Ar {1+ Bcos2r}. 
(12.16) 


We shall assume that A < 1, so that the function cos At corresponds 
to a much slower oscillation than cos 27, and also that the amplitude 
B < 1 (both of these assumptions are discussed below). Thus the terms 


proportional to cos At cos 27 dominate on each side, and equating their 
coefficients gives —4B = 2q, or 


(12.17) 


This amplitude of the fast oscillation is consistent with the result for 
a uniform electric field in eqn 12.6.'° This fast oscillation is called the 
micromotion. To determine the angular frequency A we consider how the 
mean displacement changes on a time-scale longer than the micromotion: 
the time-average of cos? 2r = 1/2 so eqn 12.16 yields —A? cos AT = 
gxB cos At;!® hence A = q;/V2 and an approximate solution is 


L = Xo COS (25 + 6 {1 + a cos ar} : (12.18) 


V2 
where gq, is defined in eqn 12.14.'" We assumed that g, <1 but it turns 
out that this approximation works better than 1% for q, < 0.4 (Wuerker 
et al. 1959). Since r = Nt/2 the mean displacement undergoes simple 
harmonic motion at an angular frequency given by 


qa eVo 
ey = = ———... WAAAY 
"2/2 /20Mr? cg 
A more detailed treatment shows that ions remain trapped for 
dz < 0.9 (12.20) 


or WwW; < 0.32. For a radio-frequency field oscillating at Q = 27 x 10 MHz 
the ion must have a radial oscillation frequency w, < 27 x 3 MHz. If we 
choose w, = 27 x 1 MHz (a convenient round number) then eqn 12.19 
gives the numerical values Vo = 500 V and ro = 1.9mm for trapping Mgt 
ions.'8 To get this high trapping frequency the ion trap has electrodes 
closer together than we assumed in the introduction. By symmetry, 
the same considerations apply for motion in the y-direction, and so we 
define a radial frequency w, = wz = Wy. The Paul trap has a sharp 
transition from stable trapping to no trapping in the radial direction 
when q, equals the maximum value of gz, in eqn 12.20. Paul used this 
feature to determine the charge-to-mass ratio e/M of the ions and hence 
perform mass spectroscopy —generally the charge state is known (e.g. it 
is e or 2e, etc.) and hence M is determined. 

So far we have only described confinement in the xy-plane. There 
are several ways to extend trapping to all three directions. For example, 
Fig. 12.3(b) shows a trap with two additional electrodes at z = +2 that 
repel the ions. For positive ions both of these end-cap electrodes have the 
same positive voltage to give a field similar to that shown in Fig. 12.1, 
with a minimum in the electrostatic potential along the z-axis—in the 
radial direction the static potential has a negligible effect compared to 
the a.c. trapping. When the linear Paul trap has axial confinement 
weaker than that in the radial direction, i.e. wz < wy = wy, the ions 
tend to lie in a string along the z-axis with only a small micromotion 
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15 Pquation 12.10 shows that the com- 
ponent of the electric field in this direc- 
tion is Eo (r) -é, = —Vox/re. 


16The term 4ABsin Arcos2r_ time- 
averages to zero. 


Tan arbitrary initial phase 69 has been 
included to make the expression more 
general but this does not affect the ar- 
gument above. 


181 comparison, neutral atoms in mag- 
netic traps oscillate at frequencies in 
the range 10-1000 Hz. 
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19See Berkeland et al. (1998) for a 
description of a state-of-the-art laser- 
cooled mercury ion frequency standard. 


in the radial direction; there is no axial micromotion because it is a d.c. 
electric field along the z-direction. We will see that after cooling the 
ions sit very close to the trap axis where the a.c. field causes very small 
perturbations. In the later discussion of laser cooling the trapped ions 
are considered simply as harmonic oscillators (neglecting micromotion). 


12.4 Buffer gas cooling 


Trapping of ions requires a vacuum, but the presence of a small back- 
ground of helium gas at a pressure (~ 10~* mbar) gives very effective 
cooling of hot ions. The ions dissipate their kinetic energy through col- 
lisions with the buffer gas atoms and this quickly brings the ions into 
thermal equilibrium with the gas at room temperature. For ions that — 
start off above room temperature the buffer gas cooling actually reduces 
the perturbations on the ions. Any slight broadening and shift of the 
ions’ energy levels by the collisions is outweighed by the reduction in the 
ions’ micromotion—the ions stay closer to the trap centre where they 
see smaller a.c. fields. 

Buffer gas cooling can be compared to having a vacuum flask that has 
partially ‘lost its vacuum’—hot coffee in the flask cools down because 
the low pressure of gas between the walls provides much less thermal 
insulation than a good vacuum. It follows from this argument that 
ions can only be cooled far below room temperature in a good vacuum, 
e.g. a pressure of 10~!! mbar in the laser cooling experiments described 
later (otherwise collisions with the hot background gas atoms heat the 
ions). This case corresponds to having a good vacuum flask; dewars in 
the laboratory work on the same principle to keep things such as liquid 
helium at temperatures much colder than the surroundings. 

Buffer gas cooling finds widespread application in instruments that 
need to operate reliably over long periods, such as the mercury ion clock 
developed at NASA’s Jet Propulsion Laboratory in Pasadena, Califor- 
nia. The linear Paul trap contains a cloud of mercury ions and mi- 
crowaves drive the transition at 40 GHz between the two hyperfine levels 
in the ground state of the ions. By reference to the resonance frequency 
of the ions, the electronic servo-control system maintains the frequency 
of the microwave source stable to 1 part in 10!* over long periods. This 
ion trap provides a very good frequency reference and has been used 
for navigation in deep space, where the accurate timing of signals trans- 
mitted to and from the probe determine its distance from a transmit- 
ter/receiver on Earth.’ Paul traps with buffer gas cooling are also used 
to give long ion storage times in some commercial mass spectrometers. 

There is a method analogous to buffer gas cooling that works at much 
lower temperatures. In sympathetic cooling a trap confines two species 
of ions at the same time, e.g. Bet and Mg*. Laser cooling one species, 
as described in the next section, e.g. the Bet ions, produces a cloud 
of cold ions that acts as the ‘buffer gas’ to cool the Mg* ions through 
collisions. The ions interact through their strong long-range Coulomb 
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repulsion, so they never come close enough together to react—in this 
sense the collisions between ions are benign (whereas neutral atoms in 
magnetic traps undergo some inelastic collisions leading to trap loss). 


12.5 Laser cooling of trapped ions 


The cooling of ions uses the same scattering force as the laser cooling 
of neutral atoms, and historically David Wineland and Hans Dehmelt 
proposed the idea of laser cooling for ions before any of the work on 
neutral atoms. The long confinement time in ion traps makes experi- 
ments straightforward in principle; but, in practice, ions have resonance 
lines in the blue or ultraviolet regions, so they often require more com- 
plicated laser systems than for neutral atoms with resonance lines at 
longer wavelengths.?° This difference arises because in ions the valence 
electron sees a more highly-charged core than in the isoelectronic neutral 
atoms, i.e. the atom with the same electronic configuration. The shorter 
wavelengths for ions also means that they generally have larger natural 
widths than neutral atoms since I depends on the cube of the transition 
frequency. This high scattering rate for resonant laser light leads to a 
strong radiative force on the ions and also allows the detection of single 
ions, as shown below. 

Each trapped ion behaves as a three-dimensional simple harmonic os- 
cillator but a single laser beam damps the motion in all directions. To 
achieve this, experimenters tune the laser frequency slightly below reso- 
nance (red frequency detuning, as in the optical molasses technique), so 
the oscillating ion absorbs more photons as it moves towards the laser 
beam than when it moves away. This imbalance in the scattering dur- 
ing the oscillations slows the ion down. This Doppler cooling works in 
much the same way as in optical molasses but there is no need for a 
counter-propagating laser beam because the velocity reverses direction 
in a bound system. The imbalance in scattering arises from the Doppler 
effect so the lowest energy is the Doppler cooling limit kgT = AI/2 (see 
Exercise 12.1). To cool the ion’s motion in all three directions the ra- 
diative force must have a component along the direction corresponding 
to each degree of freedom, i.e. the laser beam does not go through the 
trap along any of the axes of symmetry. During laser cooling the sponta- 
neously emitted photons go in all directions and this strong fluorescence 
enables even single ions to be seen! Here I do not mean detectable, but 
actually seen with the naked eye; a Bat ion with a visible transition 
appears as a tiny bright dot between the electrodes when you peer into 
the vacuum system. The resonance transition in the barium ion has a 
longer wavelength than most other ions at 493 nm in the green region of 
the visible spectrum. More generally, experiments use CCD cameras to 
detect the blue or ultraviolet radiation from the ions giving pictures such 
as Fig. 12.4. To calculate the signal from a Cat ion with a transition 
at a wavelength of 397nm and [ = 27 x 23 x 10°s~!, we use eqn 9.3 with 


20The generation of continuous-wave 
radiation at 194nm for laser cooling 
Hgt requires several lasers and fre- 
quency mixing by nonlinear optics, but 
nowadays radiation at a wavelength of 
397nm for laser cooling Cat is pro- 
duced by small semiconductor diode 
lasers. 
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Fig. 12.4 A string of calcium ions in a 
linear Paul trap. The ions have an aver- 
age separation of 10 4m and the strong 
fluorescence enables each ion to be de- 
tected individually. The minimum size 
of the image for each ion is determined 
by the spatial resolution of the imaging 
system. Courtesy of Professor Andrew 
Steane and co-workers, Physics depart- 
ment, University of Oxford. 


Fig. 12.5 The fluorescence signal from 
a single calcium ion undergoing quan- 
tum jumps. The ion gives a strong sig- 
nal when it is in the ground state and 
it is ‘dark’ while the ion is shelved in 
the long-lived metastable state. Data 
courtesy of Professors Andrew Steane 
and Derek Stacey, David Lucas and co- 
workers, Physics department, Univer- 
sity of Oxford. 
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6 =—I/2 and I = 2/.3,, so that 
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Recatt ~ 7 4x 10’ photons! (12,20 


In a typical experiment the lens that images the fluorescence onto the 
detector has an f-number (ratio of focal length to diameter) of about 2, 
so it collects 1.6% of the total number of fluorescence photons (the solid 
angle subtended over 47). A reasonable detector could have an efficiency 
of 20%, giving an experimental signal of S = 0.016 x 0.2 X Recatt = 
10° photons * that can easily be measured on a photomultiplier as in 
Fig. 12.5. (The signal is lower than the estimate because fluorescent 
photons are lost by reflection at the surfaces of optical elements, e.g. 
windows or lenses.) 

Laser cooling on a strong transition reduces the ion’s energy to the 
Doppler cooling limit AP'/2. In a trap with a spacing of hwtrap between 
vibrational energy levels, the ions occupy about ~ T'/wtrap vibrational 
levels; typically this corresponds to many levels. The minimum energy 
occurs when the ions reside in the lowest quantum level of the oscillator 
where they have just the zero-point energy of the quantum harmonic 
oscillator. To reach this fundamental limit experiments use more so- 
phisticated techniques that drive transitions with narrow line widths, as 
described in Section 12.9. 


12.6 Quantum jumps 


In addition to the strong resonance transition of natural width T used 
for laser cooling, ions have many other transitions and we now consider 
excitation of a weak optical transition with a natural width I’, where 
I’ < [. Figure12.6 shows both of the transitions and the relevant 
energy levels. The first application described here simply uses a narrow 
transition to measure the temperature accurately. We shall look at high- 
resolution laser spectroscopy later. 

A simple calculation shows that probing on a narrow transition is 
necessary to measure the temperature of ions at the Doppler cooling 
limit of the strong transition. Multiplying the velocity spread in eqn 9.29 
by a factor of 2/, as in eqn 6.38, gives the Doppler-broadening at the 
Doppler cooling limit as 


2p, | AL 


For the transition in the calcium ion whose parameters were given in 
the previous section this evaluates to Afp = 2 MHz, which is only 0.07 
times the natural width (A fy = 23 MHz). Therefore the line has a width 
only slightly greater than the natural width and measurements of this 
line width cannot determine the temperature accurately. This difficulty 
disappears for a much narrower transition where Doppler broadening 
dominates the observed line width, but this scheme presents a practical 
problem—the low rate of scattering of photons on the weak transition 
makes it hard to observe the ion. 

Ion trappers have developed a clever way to simultaneously obtain 
a good signal and a narrow line width. This experimental technique 
requires radiation at two frequencies so that both a strong and weak 
transition can be excited. When both laser frequencies illuminate the 
ions the fluorescence signal looks like that shown in Fig. 12.5. The flu- 
orescence switches off and on suddenly at the times when the valence 
electron jumps up or down from the long-lived excited state—the average 
time between the switching in this random telegraph signal depends on 
the lifetime of the upper state. These are the quantum jumps between 
allowed energy levels postulated by Bohr in his model of the hydrogen 


ata A! 
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Fig. 12.6 Three energy levels of an ion. 
The allowed transition between levels 1 
and 2 gives a strong fluorescence signal 
when excited by laser light. The weak 
transition between 1 and 3 means that 
level 3 has a long lifetime and I’ < [T 
(a metastable state). 
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2lThis is an B3 transition in nota- 
tion, where E1 and E2 signify electric 
dipole and quadrupole transitions, re- 
spectively. 


22-The logic of this method is reminis- 
cent of a case solved by the famous de- 
tective Sherlock Holmes. From the ob- 
servation that the dog did not bark in 
the night he deduced that the murderer 
was not a stranger. It is deduced that 
the ion has been excited when it is not 
observed. 


atom, and the direct observation of such jumps for a single ion provides 
new ways of testing quantum mechanics. Previous experiments mea- 
sured the ensemble average of an observable, i.e. its average value for 
a collection particles, but ion traps allow repeated measurements on a 
single object. In quantum language, each absorption and emission of a 
photon on the strong transition constitutes a measurement of the state 
of the ion—it is found to be in either the ground state or the long-lived 
excited state, and these two outcomes correspond to the two eigenval- 
ues of the observable. The fluorescence signal in Fig. 12.5 represents a 
sequence of such measurements that gives a record of the state of the 
ion. In addition to giving an insight into the nature of quantum me- 
chanics, the excitation of very narrow resonances has a very practical 
use in optical frequency standards. 

The observation of a highly forbidden transition in the ytterbium ion 
at the National Physical Laboratory at Teddington, London furnishes 
an extreme example of high-resolution spectroscopy. An ?F7 /2 level in 
Yb* can only decay to the ground level ?S;/2 by an octupole transition”* 
with a calculated natural lifetime of 10 years (Roberts et al. 1997). Even 
though excited ions are forced to decay more quickly to make experi- 
ments feasible, the rate of spontaneous emission on this transition is 
tiny. To detect this weak transition experiments use the scheme shown 
in Fig. 12.6. In addition to the weak *S;/2 *F 7/2 transition between 
levels 1 and 3 there is a strong 7S, /2—*Pj/2 transition between levels 1 
and 2. Laser radiation at frequency wy drives the weak transition for 
a time ty; then laser light resonant with a strong transition (wy, ~ w42) 
turns on for a period tage to determine the state of the ion. If the ion 
was excited to the long-lived upper level during ¢,, then it does not flu- 
oresce. If it remains in the ground state, however, photons arrive at the 
detector at a rate Rops ~ 10°s~!, as estimated below eqn 12.21 (for a 
strong transition in a calcium ion). In a period tge, = 2x 1077s there are 
Robstdet = 2000 photons detected. Repetition of these two stages of the 
measurement procedure as wy, scans over the frequency range around the 
narrow resonance at w 3 produces a plot of the probability of exciting 
the narrow transition (in time ty) versus laser frequency. The line width 
of the observed resonance depends on the measurement time, since other 
broadening mechanisms are negligible. Fourier transform theory shows 
that the frequency width of a pulse is inversely proportional to its dura- 
tion, Awops ~ 1/ty—the same relation used in transit-time broadening 
(eqn 7.50). This corresponds to counting the number of cycles of the 
laser radiation, fi/ty to within +} of a cycle (ft: = wy: /27). Frequency 
standards use this detection scheme with alternate periods of probing 
and measurement because the weak transition must not be perturbed 
by having a strong interaction at the same time.2” 
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12.7 The Penning trap and the Paul trap 


For experiments with several ions the linear Paul trap has the advantage 
that the ions lie like a string of beads along the trap axis, with little mi- 
cromotion. However, most of the pioneering experiments on ion traps 
used the electrode configuration shown in Fig. 12.7. The cylindrical 
symmetry of these electrodes imposes boundary conditions on the elec- 
trostatic potential, as in Section 12.3.3. Solutions of Laplace’s equation 
that satisfy these conditions have the form 


a) 


d= do + a2 (?- ; 


(17,23) 
The surfaces of constant potential have a hyperbolic cross-section in 
any radial plane, e.g. y = 0; in many experiments the electrodes match 
the shape of these equipotentials so that the potential corresponds to 
eqn 12.23 right out to the electrode surfaces, but any cylindrically- 


(a) 
Ring 
———— 
Y 
(b) B 


Ee Fee Sie e Poe 
ABBE: y. 


Fig. 12.7 The electrode configuration 
of (a) the Paul trap and (b) the Penning 
trap, shown in cross-section. The lines 
between the end caps and ring elec- 
trode indicate the electric field lines; 
the Paul trap has an oscillating elec- 
tric field but the Penning trap has static 
electric and magnetic fields. The elec- 
trodes shown have a hyberbolic shape 
(hyperbolae rotated about the z-axis), 
but for a small cloud of ions con- 
fined near to the centre any reason- 
able shape with cylindrical symmetry 
will do. Small ion traps with di- 
mensions ~ 1mm generally have sim- 
ple electrodes with cylindrical or spher- 
ical surfaces (cf. Fig. 12.3). Courtesy of 
Michael Nasse. 
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3-The higher-order terms 24, 22y”, etc. 
have negligible influence near the ori- 
gin. 


24.5 shown in Fig. 1.6 for the classi- 
cal model of the Zeeman effect. This 
assumes zero velocity along the z-axis. 
See also Blundell (2001). 


2°See Bleaney and Bleaney (1976, 
Problem 4.10). 


symmetric electrodes work well for laser cooling experiments since the | 
cold ions remain close to the trap centre, where symmetry constrains 
the potential to have the form of eqn 12.23.73 An a.c. voltage between 
the ring electrode and the two end caps gives a Paul trap that works 
on the same principle as the two-dimensional trapping described in Sec- 
tion 12.3.3. For cylindrical symmetry the trap has an electric field gra- 
dient along z whose magnitude is twice as large as the gradient along x 
or y. This difference from the linear quadrupole field means that the re- 
gion of stable Paul trapping occurs at slightly different voltages (values 
of the parameter gq, defined in eqn 12.14), as described in the book on 
ion traps by Ghosh (1995). 

The Paul trap principle can be used to confine charged particles in 
air, e.g. dust, or charged droplets of glycerin of diameter 10-100 ym. 
In this demonstration apparatus, the strong damping of the motion by _ 
the air at atmospheric pressure helps to confine charged particles over a 
large range of a.c. voltages (see Nasse and Foot (2001), which contains 
references to earlier work). In comparison, the well-known Millikan oil 
drop experiment just levitates particles by balancing gravity with an 
electrostatic force. 


12.7.1 The Penning trap 


The Penning trap has the same electrode shape as the Paul trap (as 
shown in Fig. 12.7) but uses static fields. The Paul trap is generally 
assumed to have cylindrical symmetry unless specifically stated as being 
a linear Paul trap. In a Penning trap for positive ions, both end caps 
have the same positive voltage (with respect to the ring electrode) to 
repel the ions and prevent them escaping along the axis (cf. the trap in 
Fig. 12.3). With only a d.c. electric field the ions fly off in the radial 
direction, as expected from Earnshaw’s theorem, but a strong magnetic 
field along the z-axis confines the ions. The effect of this axial magnetic 
field can be understood by considering how a charged particle moves in 
crossed electric and magnetic fields. In an electric field E = Ee, the 
force F = eEF accelerates the positive ion along the z-axis. In a region 
where there is a uniform magnetic field B = Be, the Lorentz force 
F = ev x B causes the ion to execute circular motion at the cyclotron 
frequency”4 


a 12.24 
We = —. 
In a region of crossed electric and magnetic fields the solution of the 
equations of motion gives?° 
E 
t= a (1 — coswet), 
E : 12-2 
a Bas. (Wet — sin wet) , (12.25) 
20 


for the initial conditions f = y = z= 4 = y = 2 = 0. The trajectory 
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of the ion is a cycloid: a combination of circular motion at w, and 
drift at velocity E/B in the y-direction,”® as illustrated in Fig. 12.8(c). 
The counter-intuitive drift of the ion’s average position in the direction 
perpendicular to E is the key to the operation of the Penning trap. The 
drift of the ion perpendicular to the radial electric field gives a tangential 
component of velocity and causes the ion to move slowly around the 
z-axis (direction of B) at the magnetron frequency wm whilst at the 
same time undergoing cyclotron orbits,?" as shown in Fig. 12.8(d). The 
electrode structure shown in Fig. 12.7 gives a radial field in the plane z = 
0. In addition to we and wm, the ion’s motion has a third characteristic 
frequency w, associated with oscillations along the z-axis of the trap 
(analogous to the axial motion between the two d.c. electrodes at either 
end of the linear Paul trap). Usually the three frequencies have widely 


261t is easily seen that E/B has the di- 
mensions of a velocity by looking at the 
expressions for the electric and mag- 
netic forces given above. 


item magnetron, a beam of elec- 
trons in crossed E- and B-fields moves 
in a similar way to that shown in 
Fig. 12.8, but much faster than ions be- 
cause of the smaller mass and higher 
electric field. These electrons radiate 
electromagnetic radiation at wm in the 
microwave region, e.g. at 2.5GHz for 
the magnetrons in domestic microwave 
ovens (Bleaney and Bleaney 1976, Sec- 
tion 21.5). 


Fig. 12.8 The motion of a positively-charged ion in various configurations of electric and magnetic fields. (a) A uniform electric 
field along the z-direction accelerates the ion in that direction. (b) A uniform magnetic flux density B along the z-direction 
(out of the page) leads to a circular motion in the plane perpendicular to B, at the cyclotron frequency we. (c) In a region 
of crossed electric and magnetic fields (E, and Bz, respectively) the motion described by eqns 12.25 is drift at velocity E/B 
perpendicular to the uniform electric field in addition to the cyclotron orbits. (The ion is initially stationary.) (d) In a Penning 
trap the combination of a radial electric field and axial magnetic field causes the ion to move around in a circle at the magnetron 
frequency. (If B = 0 then the ion would move radially outwards and hit the ring electrode.) 
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different values w, < Wm < we (Exercise 12.4). 


12.7.2 Mass spectroscopy of ions 


The determination of the mass of ions by means of eqn 12.20 for Paul 
trapping has been mentioned earlier. Alternatively, measurement of the 
ratio of cyclotron frequencies of two different species of ion in the same 
Penning trap gives their mass ratio: 

w,  eB/M' M 


Cc 


we  eB/M  M’° 


(12.26) 


This assumes the simplest case with two species of equal charge, but the 
ratio of the charges is always known exactly. Superconducting magnets 
give very stable fields so that the cancellation of B in the above equa- 
tion introduces very little uncertainty and in this way masses can be 
compared to better than 1 part in 10°. 


12.7.3. The anomalous magnetic moment of the 
electron 


The advantages of the Penning trap have been exploited to make precise 
measurements of the magnetic moment of the electron (confined in the 
same way as ions but with a negative voltage on the end caps). From 
an atomic physics perspective, this can be viewed as a measurement of 
the Zeeman effect for an electron bound in a trap rather than one bound 
in an atom (Dehmelt 1990), but the splitting between the two magnetic 
states m, = +1/2 is the same in both situations, corresponding to a 
frequency Aw = g,upB/h = g,eB/2m-.. Measurement of this frequency 
gives the gyromagnetic ratio for spin g;. To determine B accurately 
they measured the cyclotron frequency w, = eB/me and found the ratio 


Aw Qs 


. 12. 
= 5 (12.27) 
The relativistic theory of quantum mechanics developed by Dirac pre- 
dicts that g, should be exactly equal to 2, but the incredibly precise 
measurement by Van Dyck et al. (1986) found 


= = 1.0011596521884(4) . 


The accuracy is better than 4 in 10'*. Often this is quoted as a mea- 
surement of g — 2 for the electron and the difference from 2 arises from 
quantum electrodynamic (QED) effects. For the electron the theoretical 
calculation gives 

Ys a a? a\3 a\4 

F=1+ 5 +4o(S) +40(=) +4. (FS) +... (22.28) 

?, 27 T T T 
The very detailed calculations give the coefficients Az = —0.328478965, 
A3 = 1.17611 and A, = —0.99. The numerical value of this expression 


agrees well with the experimental value given above and this provides 
a very stringent test of the theory.?° Actually, at the time of the first 
measurements the fine-structure constant a was not known to enough 
decimal places, so the argument was turned around—the theory was 
assumed to be correct and used to deduce the value of a. Writing the 
theoretical value as coefficients multiplying various powers of a (in this 
case a/7) reflects the way theorists carry out QED calculations. Each 
power corresponds to perturbations of a given order. To match the 
accuracy of the experiment required evaluation of the contribution from 
all orders of perturbation up to and including the fourth order.?? As 
might be imagined, it took many years of careful work to match the 
phenomenal precision of the experiment. 

Similar experiments have also been carried out for the positron, the 
antimatter counterpart of the electron, and the comparison of the prop- 
erties of particles and antiparticles gives interesting tests of the funda- 
mental symmetry principles of particle physics. The very accurate theo- 
retical calculations of the magnetic moments can only be made for simple 
particles without internal structure (leptons). Other QED experiments 
provide complementary information; for example, the measurements of 
the Lamb shift in hydrogen, and highly-ionized hydrogenic ions test the 
theory for an electron in a bound state where the calculations are con- 
siderably more complicated than for a free electron. It is very important 
to understand how to apply field theories like QED to bound systems. 


12.8 Electron beam ion trap 


The electron beam ion trap (EBIT) was developed to confine ions that 
have lost many electrons and which have energies much higher than 
those in typical experiments with Paul and Penning traps. 

Figure 12.9 shows a schematic layout of an EBIT. Such a device is 
physically much larger than the other types of trap, but still much 
smaller than the particle accelerators that were previously used to pro- 
duce highly-ionized ions.2° The EBIT confines positive ions by their 
strong electrostatic attraction to the high negative charge density in an 
electron beam along the axis of the trap—the ions stay within this elec- 
tron beam most of the time. The electrons emanate from an electron 
gun with a high current density over a small area, but the space charge 
in the beam tends to cause divergence. A strong axial magnetic field 
counteracts this spreading to keep the electrons tightly focused. This 
magnetic field acts as in a Penning trap to prevent the electrons mov- 
ing radially outwards under the influence of the radial electric field—the 
same field that confines positive ions pushes electrons outwards.*! Elec- 
trodes with a d.c. voltage of several kilovolts restrict the motion of the 
ions along the electron beam. (In comparison, the Penning trap only 
has a few volts on the end caps.) 

In addition to trapping, the EBIT also produces the ions by the fol- 
lowing ionization steps.?” Atoms or ions in a low-charge state injected 
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28in 1998 CODATA recommended the 
value of a = 1/137.0359979. The latest 
value may be found on the NIST web 
site. 


2° This required the evaluation of about 
1000 contributions, each represented by 
different Feynman diagrams, and the 
calculations are still being refined by 
Kinoshita (1995) and Kinoshita and 
Nio (2003). 


30The EBIT uses a hybrid of the tech- 
niques in ion trapping and accelerator 
physics to extend the precision of traps 
to higher energies. 


31-The magnetic field has a negligible 
effect directly on the ions, in compari- 
son to the electrostatic force from the 
space charge of the electrons. 


321 has the alternative name of an elec- 
tron beam ion source or EBIS. 
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Fig. 12.9 A cross-section of an electron 
beam ion trap that has cylindrical sym- 
metry. The high-energy electron beam 
along the axis of the trap (that is obvi- 
ously a negative space charge) attracts 
positive ions to give radial confinement 
and ionizes them further. The elec- 
trodes (called drift tubes by accelerator 
physicists) give confinement along the 
axis, i.e. the top and bottom drift tubes 
act like end caps as in a Penning trap 
but with much higher positive voltage. 
To the right is an enlarged view of the 
ions in the electron beam and the form 
of the electrostatic potentials along the 
radial and axial directions. Courtesy of 
Professor Joshua Silver and co-workers, 
Physics department, University of Ox- 
ford. 
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into the EBIT region have electrons knocked off by the electron beam 
to form positive ions. These ions become confined within the electron 
beam where bombardment by the high-energy electrons removes more 
and more electrons, so that the ions become more highly charged. This 
process continues until the electrons remaining on the ion have a bind- 
ing energy greater than the energy of the incoming electrons. Thus the 
final charge state of the trapped ion is controlled by varying the accel- 
erating voltage on the electron gun. As an extreme example, consider 
stripping all but one of the electrons off a uranium atom. The final 
stage of the ionization process to produce Ut®! requires an energy of 
13.6 x (92)? ~ 10° eV, ie. an accelerating voltage of 100kV. These ex- 
treme conditions can be achieved but many EBIT experiments use more 
modest voltages on the electron gun of a few tens of kilovolts. 

The transitions between the energy levels of highly-charged ions pro- 
duce X-rays and the spectroscopic measurements of the wavelength of 
the radiation emitted from EBITs, use vacuum spectrographs (often 
with photographic film as the ‘detector’ since it has a high sensitivity at 
short wavelengths and gives good spatial resolution). Such traditional 
spectroscopic methods have lower precision than laser spectroscopy but 
QED effects scale up rapidly with increasing atomic number. The Lamb 
shift increases as (Za)* whereas the gross energy scales as (Za)*, so 
measurements of hydrogen-like ions with high Z allow the QED effects 
to be seen. It is important to test the QED calculations for bound states 
because, as mentioned in the previous section, they require distinctly dif- 
ferent approximations and theoretical techniques to those used for free 
particles. Indeed, as Z increases the parameter Za in the expansions 


used in the QED calculations becomes larger and higher orders start to 
make a greater contribution. 


12.9 Resolved sideband cooling 


Laser cooling on a strong transition of line width I rapidly reduces the 
energy of a trapped ion down to the Doppler cooling limit AI'/2 of that 
transition. ‘The laser cooling of an ion works in a very similar way to 
the Doppler cooling of a free atom (see Exercise 9.8). To go further 
narrower transitions are used. However, when the energy resolution of 
the narrow transition AI” is less than the energy interval hw, between the 
vibrational levels of the trapped ion (considered as a quantum harmonic 
oscillator) the quantisation of the motion must be considered, i.e. in the 
regime where 


i <<e |, (12.29) 


We have seen that typically [/27 is tens of MHz and w,/2m7 ~ 1 MHz. 
The vibrational energy levels have the same spacing Aw, in both the 
ground and excited states of the ion since the vibrational frequency 
depends only on the charge-to-mass ratio of the ion and not its internal 
state, as illustrated in Fig. 12.10. The trapped ion absorbs light at 
the angular frequency of the narrow transition for a free ion wo, and 
also at the frequencies wy = wo + wy, Wo + 2wy, Wo + 3uy, etc. that 
correspond to transitions in which the vibrational motion of the ion 
changes. The vibrational levels in the ground and excited states are 
labelled by the vibrational quantum numbers v and v’, respectively, 
and these sidebands correspond to transitions in which v' # v. The 
energy of the bound system is reduced by using laser light at frequency 
WE = Wo — Wy to excite the first sideband of lower frequency, so that 
the ion goes into the vibrational level with v’ = v — 1 in the upper 
electronic state. This excited state decays back to the ground state—the 
most probable spontaneous transition is the one in which the vibrational 
level does not change so that, on average, the ion returns to the ground 
electronic state in a lower vibrational level than it started. A detailed 
explanation of the change in v during spontaneous emission would need 
to consider the overlap of the wavefunctions for the different vibrational 
levels and is not given here.** 

This sideband cooling continues until the ion has been driven into the 
lowest vibrational energy level. An ion in the v = 0 level no longer 
absorbs radiation at wo — wy, as indicated in Fig. 12.10—experiments 
use this to verify that the ion has reached this level by scanning the 
laser frequency to observe the sidebands on either side, as shown in 
Fig. 12.10(b); there is little signal at the frequency of the lower sideband, 
as predicted, but there is a strong signal at the frequency of the upper 
sideband.** The lower sideband arises for the population in the v = 1 
level (and any higher levels if populated), and the upper sideband comes 
from ions in v = 0 that make a transition to the v’ = 1 level. Thus the 
ratio of the signals on these two sidebands gives a direct measure of the 
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33 The situation closely resembles that 
in the Franck—Condon principle that 
determines the change in vibrational 
levels in transitions between electronic 
states of molecules—the relevant po- 
tentials for an ion, shown in Fig. 12.10, 
are simpler than those for molecules. 


$474 is easy to be misled into think- 
ing that the number of sidebands ob- 
served depends on the occupation of 
the vibrational levels of the trapped 
ion (especially if you are familiar with 
the vibrational structure of electronic 
transitions in molecules). This exam- 
ple shows that this is not the case for 
ions, 1.e. sidebands arise for an ion that 
is predominantly in the lowest vibra- 
tional level. Conditions can also arise 
where there is very weak absorption at 
the frequencies of the sidebands, even 
though many vibrational levels are oc- 
cupied, e.g. for transitions whose wave- 
length is much larger than the region in 
which the ions are confined. Although 
sidebands have been explained here in 
terms of vibrational levels, they are not 
a quantum phenomenon—there is an 
alternative classical explanation of side- 
bands in terms of a classical dipole that 
emits radiation while it is vibrating to 
and fro. 
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Fig. 12.10 (a) The vibration levels of an ion in a harmonic trap have the same spacing for both the ground and excited 
electronic states. Excitation by light of frequency wr = wo — wy, followed by spontaneous emission, leads to a decrease in the 
vibrational quantum number, until the ion reaches the lowest level with v = 0, from whence there is no transition, as indicated 
by the dotted line at this frequency (but light at frequency wo and wo + wy would excite the ion in v = 0). (b) The spectrum 
of a trapped ion shows sidebands on either side of the main transition. The inset figure shows the spectrum before sideband 
cooling. The enlarged part of the figure shows that after sideband cooling the signal on the upper sideband Sy at angular 
frequency wo + wy is stronger than the lower one Sj, at wo — wy—this asymmetry indicates that the ion is mainly in the lowest 
vibrational state (i.e. vibrational quantum number much less than 1). The vertical axis gives the transition probability, or the 
probability of a quantum jump during the excitation of the narrow transition. After Diedrich et al. (1989). Part (b), copyright 
1989 by the American Physical Society. 


ratio of populations: in this example N (v = 1) /N (v = 0) = 0.05 so the 
ion spends most of its time in the lowest level. Thus the ion has almost 
the minimum energy attainable in this system. 

Single ion experiments such as optical frequency standards with ex- 
tremely high resolution (Section 12.6) do not need to cool the ion to the 
very lowest level—they just pick out the transition at wo from the well- 
resolved sidebands. The quantum computing experiments described in 
the next chapter, however, must have an initial state with all the ions in 
the lowest energy level of the trap (or very close to this ideal situation) 
to give complete control over the quantum state of the whole system. 
The preparation of all the trapped ions in the lowest vibrational level®° 
is complicated by the collective modes of vibration of a system with 
more than one trapped ion (Exercise 12.1), and the achievement of this 
initial state stretches the capability of laser cooling methods to their 
very limits.°° 


12.10 Summary of ion traps 


This chapter explored some of the diverse physics of ion trapping, rang- 
ing from the cooling of ions to temperatures of only 107? K in small ion 
traps to the production of highly-charged ions in the EBIT. Trapping 
of positrons was mentioned in Section 12.7.3 and ion traps make good 
containers for storing other types of antimatter such as antiprotons pro- 
duced at particle accelerators.*” In recent experiments at CERN carried 
out by a large collaboration (Amoretti et al. 2002) these two antiparti- 
cles have been put together to produce anti-hydrogen. In the future it 
will be possible to do anti-atomic physics, e.g. to measure whether hydro- 
gen and anti-hydrogen have the same spectra (a test of CPT invariance.) 
This high-energy trapping work has developed from accelerator-based 
experiments and probes similar physics. 

At the opposite pole lies the work on the laser cooling of ions to ex- 
tremely low energies. We have seen that the fundamental limit to the 
cooling of a bound system is quite different to the laser cooling of free 
atoms. Experimenters have developed powerful techniques to manip- 
ulate single ions and make frequency standards of extreme precision. 
The long decoherence times of trapped ions are now being exploited to 
carry out the manipulation of several trapped ions in experiments on 
quantum computation, which is the subject of the next chapter. Such 
experimental techniques give exquisite control over the state of the whole 
quantum system in a way that the founders of quantum mechanics could 
only dream about. 


Further reading 
The book on ion trapping by Ghosh (1995) gives a detailed account of 


these techniques. See also the tutorial articles by Wayne Itano (Itano 
et al. 1995) and David Wineland (Wineland et al. 1995), and the Nobel 
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35For neutral particles in magnetic 
traps, quantum statistics causes the 
atoms to undergo Bose-Einstein con- 
densation into the ground state, even 
though they have a mean thermal en- 
ergy greater than the spacing between 
trap energy levels. Quantum statistics 
does not affect trapped ions because 
they are distinguishable—even if the 
ions are identical the mutual Coulomb 
repulsion keeps them far apart, as 
shown in Fig. 12.4, and the strong flu- 
orescence enables the position of each 
ion to be determined. 


36 The alert reader may have noticed 
that we have not discussed the recoil 
limit, that plays such an important role 
for free particles. For stiff traps the 
spacing of the vibrational energy lev- 
els greatly exceeds the recoil energy 
hwy >> Evec, and the cooling limit of 
the trapped particle is determined by 
the zero-point energy. 


37 Just after its creation in high-energy 
collisions, the antimatter has an energy 
of MeV but it is moderated to energies 
of keV before trapping. 
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380n the web site of the Nobel prizes. 


prize lecture of Wolfgang Paul.*® The National Physical Laboratory in 


the UK and the National Institute of Standards and Technology in the 
US provide internet resources on the latest developments and research. 


Exercises 


(12.1) The vibrational modes of trapped ions 
Two calcium ions in a linear Paul trap lie in a line 
along the z-axis. 


(a) 


(b) 


(e) 


The two end-cap electrodes along the z-axis 
produce a d.c. potential as in eqn 12.23, with 
a2 = 10° Vm™?. Calculate w.. 


The displacements z; and z2 of the two ions 
from the trap centre obey the equations 


Vee oars een 
are z ? 
(22 - 21)” 
2 
Ss 4 
Mz = —Mw?z2 + BEER 
(22 — 21) 


Justify the form of these equations and show 
that the centre of mass, zm = (21 + 22)/2 
oscillates at w,. 

Calculate the equilibrium separation a of two 
singly-charged ions. 

Find the frequency of small oscillations of the 
relative position z = z2 — 21 —a. 


Describe qualitatively the vibrational modes 
of three ions in the trap, and the relative or- 
der of their three eigenfrequencies.°? 


(12.2) Paul trap 


(a) 


(b) 


(c) 


For Hgt ions in a linear Paul trap with di- 
mensions ro = 3mm, calculate the maximum 
amplitude Vinax of the radio-frequency voltage 
at Q = 27 x 10 MHz. 


For a trap operating at a voltage Vo = 
Vee V2, calculate the oscillation frequency 
of an Hg* ion. What happens to a Ca‘ ion 
when the electrodes have the same a.c. volt- 
age? 

Estimate the depth of a Paul trap that has 
Vo = Vinax/W2, expressing your answer as a 
fraction of eVo. 


(d) Explain why a Paul trap works for both posi- 


tive and negative ions. 


(12.3) Investigation of the Mathieu equation 


(12.4) 


Numerically solve the Mathieu equation and plot 
the solutions for some values of gz. Give exam- 
ples of stable and unstable solutions. By trial and 
error, find the maximum value of g, that gives a 
stable solution, to a precision of two significant 
figures. Explain the difference between precision 
and accuracy. [Hint. Use a computer package for 
solving differential equations. The method in Ex- 
ercise 4.10 does not work well when the solution 
has many oscillations because its numerical inte- 
gration algorithm is too simple.] 


The frequencies in a Penning trap 

A Penning trap confines ions along the axis by 
repulsion from the two end-cap electrodes; these 
have a d.c. positive voltage for positive ions that 
gives an axial oscillation frequency, as calculated 
in Exercise 12.1. This exercise looks at the radial 
motion in the z = 0 plane. The electrostatic po- 
tential in eqn 12.23 with az = 10° Vm~? leads to 
an electric field that points radially outwards, but 
the ion does not fly off in this direction because 
of a magnetic field of induction B = 1T along the 


Z-axis. 
Consider a Cat ion. 


(a) Calculate the cyclotron frequency. 


(b) Find the magnetron frequency. [Hint. Work 


out the period of an orbit of radius r in a 
plane perpendicular to the z-axis by assuming 
a mean tangential velocity v = E(r)/B, where 
E(r) is the radial component of the electric 
field at r.] 


3°They resemble the vibrations of a linear molecule such as COz, described in Appendix A; however, a quantitative treatment 
would have to take account of the Coulomb repulsion between all pairs of ions (not just nearest neighbours). 


(12.5) Production of highly-charged ions in an EBIT 


(a) 


(b) 
(c) 


(d) 


Estimate the accelerating voltage required for 
an electron beam voltage that produces hy- 
drogenic silicon Si!** in an EBIT. 


Calculate the radius of the first Bohr orbit 
(n = 1) in hydrogenic uranium, UT?’. 
Calculate the binding energy of the electron in 


U**! and express it as a fraction of the atom’s 
rest mass energy Mc’. 


QED effects contribute 3 x 107~°eV to the 
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binding energy of the 1s ground configuration 
of atomic hydrogen. Express this as a frac- 
tion of the hydrogen atom’s rest mass. Esti- 
mate the magnitude of the QED contribution 
in the ground state of hydrogen-like uranium 
Ut as a fraction of the rest mass. This frac- 
tion gives the precision AM/M with which 
the ion’s mass must be determined in order to 
measure QED effects. Discuss the feasibility 
of doing this in an ion trap. 
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' Quantum computing requires precise 
control of the motion of the ions, i.e. 
their external degrees of freedom, as 
well as their internal state |F, Mp). 


Quantum computing 


Quantum computing will be a revolutionary new form of computation in 
the twenty-first century, able to solve problems inaccessible to classical 
computers. However, building a quantum computer is very difficult, and 
so far only simple logic gates have been demonstrated in experiments on 
ions in a linear Paul trap. The ideas of quantum computing have also 
been tested in experiments using nuclear magnetic resonance (NMR). 

In the Paul trap, the ions sit in an ultra-high vacuum (pressure ~ 
10-1! mbar) so that collisions rarely happen, and the ions are well iso- 
lated from the environment. We have seen that these conditions enable 
extremely high-resolution spectroscopy of single ions because of the very 
small perturbations to the energy levels. Quantum computing requires 
more than one ion in the trap, and all of these ions must be cooled to the 
lowest vibrational level to give a well-defined initial quantum state for 
the system.' This presents a much greater experimental challenge than 
the laser cooling of a single ion to reduce Doppler broadening, but it has 
been achieved in some experiments. The previous chapter describes the 
physics of the linear Paul trap but for the purposes of this chapter we 
simply assume that the trap produces a harmonic potential with strong 
confinement in the radial direction, so that the ions lie in a line along its 
z-axis, as shown in Fig. 13.1; their mutual electrostatic repulsion keeps 
the ions far enough apart for them to be seen separately. 


Fig. 13.1 A string of four ions in a linear Paul trap. Coulomb repulsion keeps the 
ions apart and the gap corresponds to an ion in the dark state. In this experiment 
two laser beams simultaneously excite strong and weak transitions to give quantum 
jumps, as described in Section 12.6, so that each ion flashes on and off randomly. This 
snapshot of the system at a particular time could be taken to represent the binary 
number 1101. A quantum logic gate requires much more sophisticated techniques in 
which laser pulses determine precisely the initial state of each individual ion in the 
chain, as described in this chapter. Courtesy of Professors A. M. Steane and D. N. 
Stacey, D. M. Lucas and co-workers, Physics department, University of Oxford. 


13.1 Qubits and their properties 


A classical computer uses bits with two values 0 or 1 to represent binary 
numbers, but a quantum computer stores information as quantum bits 
or qubits (pronounced Q-bits). Each qubit has two states, labelled |0) 
and |1) in the Dirac ket notion for quantum states. Most theoretical 
discussions of quantum computing consider the qubit as a spin-1/2 ob- 
ject, so the two states correspond to spin down |m, = —1/2) and spin up 
[ms = +1/2). However, for a trapped ion the two states usually corre- 
spond to two hyperfine levels of the ground configuration, as illustrated 
in Fig. 13.2. In the following discussion, |1) represents the ion in the 
upper hyperfine level and |0) is used for the lower hyperfine level, but 
all of the arguments apply equally well to spin-1/2 particles since the 
principles of quantum computing clearly do not depend on the things 
used as qubits. Ions, and other physical qubits, give a compact way of 
storing information, e.g. |1101) represents the binary number 1101 in 
Fig. 13.1, but the quantum features of this new way of encoding infor- 
mation only become apparent when we consider the properties of more 
than one qubit in Section 13.1.1. Even though a single qubit gener- 
ally exists in a superposition of the two states, a qubit does not carry 
more classical information than a classical bit, as shown by the following 
argument. The superposition of the two states 


Pqubit = @|0) + b[1) (13.1) 


obeys the normalisation condition |a|? + |b]? = 1. We write this super- 
position in the general form 


Yqudit = {co (5) |0) +e! sin (5) m)} ar: (13.2) 


The overall phase factor has little significance and the possible states cor- 
respond to vectors of unit length with direction specified by two angles 
and ¢. These are the position vectors of points lying on the surface of a 
sphere, as in Fig. 13.3. The state |Q) lies at the north pole of this Bloch 
sphere and |1) at the south pole; all other position vectors are superpo- 
sitions of these two basis states. Since these position vectors correspond 
to a simple classical object such as a pointer in three-dimensional space, 
it follows that the information encoded by each qubit can be modelled 
in a classical way. 

The analogy between a qubit and a three-dimensional pointer seems to 
imply that a qubit stores more information than a bit with two possible 
values 0 or 1, e.g. like one of the hands of an analogue clock that gives us 
information about the time by its orientation in two-dimensional space. 
Generally, however, this is not true because we cannot determine the ori- 
entation of a quantum object as precisely as the hands on a clock. Mea- 
surements can only distinguish quantum states, with a high probability, 
if the states are very different from each other, occupying well-separated 
positions on opposite sides of the Bloch sphere. A measurement on a 
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N=|r +i 


0) = |F, M) 


Fig. 13.2 The two hyperfine levels in 
the ground state of an ion. Gener- 
ally, experiments use an ion with to- 
tal angular momentum J = 1/2 in the 
lowest electronic configuration, so that 
F = 1I+1/2. The qubits |0) and [1) 
correspond to two particular Zeeman 
states in the two levels, e.g. |F, M) and 
|F + 1, M’), respectively. 
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Fig. 13.3 The state |0) lies at the 
north pole of this Bloch sphere and 
|1) at the south pole; all other posi- 
tion vectors are superpositions of these 
two basis states. The Bloch sphere 
lies in the Hilbert space spanned by 
the two eigenvectors |0) and |1). The 
Hadamard transformation defined in 
eqn 13.3 takes |0) ++ {0) + |1) (cf. 
Des, 2). 


2In this chapter wavefunctions are writ- 
ten without normalisation, which is the 
common convention in quantum com- 
putation. 


spin-1/2 particle determines whether its orientation is up or down along 
a given axis. After that measurement the particle will be in one of those 
states since the act of measurement puts the system into an eigenstate 
of the corresponding operator. In the same way, a qubit will give either 
0 or 1, and the read-out process destroys the superposition. 

The Bloch sphere is very useful for describing how individual qubits 
transform under unitary operations. For example, the Hadamard trans- 
formation that occurs frequently in quantum computation (see the ex- 
ercises at the end of this chapter) has the operator? 


Oi |0) ++ |0) + |1), 
Uy |1) ++ |0) — |1). 


This is equivalent to the matrix 


a cel ae 
te) 4) 


The effect of this unitary transformation of the state is illustrated in 
Fig. 13.3—it corresponds to a rotation in the Hilbert space containing 
the state vectors. This transformation changes |0), at the north pole, 
into the superposition given in eqn 13.3 that lies on the equator of the 
sphere. 


(13.3) 


13.1.1 Entanglement 


We have already encountered some aspects of the non-intuitive behaviour 
of multi-particle quantum systems in the detailed description of the two 
electrons in the helium atom (Chapter 3), where the antisymmetric spin 
state [ |{t) — |N) ]/W2 corresponds to the wavefunction 


W = |01) — |10) (13.4) 


in the notation used in this chapter (without normalisation). This does 
not factorise into a product of single-particle wavefunctions: 


VA vite, (13.5) 


where 
dive = [al0) +b|1)], [el0) +d1)], . (13.6) 


Here c and d are additional arbitrary constants, and the futility of at- 
tempting to determine these constants quickly becomes obvious if you 
try it. Generally, we do not bother with the subscript used to denote the 
particle, so |0),|1). = |01) and |1),|1), = |11), etc. Multiple-particle sys- 
tems that have wavefunctions such as eqn 13.4 that cannot be written as 
a product of single-particle wavefunctions are said to be entangled. This 
entanglement in systems with two, or more, particles leads to quantum 
properties of a completely different nature to those of a system of clas- 
sical objects—this difference is a crucial factor in quantum computing. 
Quantum computation uses qubits that are distinguishable, e.g. ions at 
well-localised positions along the axis of a linear Paul trap. We can label 
the two ions as Qubit 1 and Qubit 2 and know which one is which at any 
time. Even if they are identical, the ions remain distinguishable because 
they stay localised at certain positions in the trap. For a system of dis- 
tinguishable quantum particles, any combination of the single-particle 
states is allowed in the wavefunction of the whole system: 


W = Al00) + BIO1) + C]10) + Dj11). (13.7) 


The complex amplitudes A, B, C and D have arbitrary values. It is 
convenient to write down wavefunctions without normalisation, e.g. 


W = |00) + JO1) + |10) + |11), (13.8) 
& = |00) + 2/01) + 3/11), (13.9) 
W = (01) + 5/10). (13.10) 


Two of these three wavefunctions possess entanglement (see Exercise 
13.1). We encounter examples with three qubits later (eqn 13.12). 

In the discussion so far, entanglement appears as a mathematical prop- 
erty of multiple-particle wavefunctions, but what does it mean physi- 
cally? It is always dangerous to ask such questions in quantum me- 
chanics, but the following discussion shows how entanglement relates 
to correlations between the particles (qubits), thus emphasising that 
entanglement is a property of the system as a whole and not the indi- 
vidual particles. As a specific example consider two trapped ions. To 
measure their state, laser light excites a transition from state |1) (the 
upper hyperfine level) to a higher electronic level to give a strong flu- 
orescence signal, so |1) is a ‘bright state’, while an ion in |0) remains 
dark.? Wavefunctions such as those in eqns 13.4 and 13.10 that contain 
only the terms |10) and |01) always give one bright ion and the other 
dark, i.e. an anticorrelation where a measurement always finds the ions 
in different states. To be more precise, this corresponds to the following 
procedure. First, prepare two ions so that the system has a certain ini- 
tial wavefunction W;,, then make a measurement of the state of the ions 
by observing their fluorescence. Then reset the system to Wj, before 
another measurement. The record of the state of the ions for a sequence 
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3This is similar to the detection of 
quantum jumps in Section 12.6, but 
typically quantum computing experi- 
ments use a separate laser beam for 
each ion to detect them independently. 
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4 Bell considered the well-known EPR 
paradox and the system of two ions de- 
scribed above has the same properties 
as the two spin-1/2 particles usually 
used in quantum mechanics texts (Rae 
1992). 


of such measurements looks like: 10, 10, 01, 10, 10,.... Each ion 
gives a random sequence of 0s and 1s but always has the opposite state 
to the other ion. 

This example does not illustrate the full subtlety of entanglement be- 
cause we would get the same result if we prepared the two ions either 
in |01) or |10) randomly at the beginning. Such an apparatus produces 
correlated pairs of ions in a purely classical way that mimics the quan- 
tum situation. John Bell proved that we can make measurements that 
distinguish the ‘classical correlation’ from an entangled state. The above 
description shows that this cannot be done simply by making measure- 
ments along the axes defined by the basis states |0) and |1), but it turns 
out that quantum entanglement and ‘classically-correlated’ particles give 
different results for measurements along other sets of axes. This was a 
very profound new insight into the nature of quantum mechanics that 
has stimulated much important theoretical and experimental work.* 

‘Entanglement implies correlation but correlation does not imply en- 
tanglement’. In the following we shall concentrate mainly on the first 
half of this statement, i.e. two-particle systems encode quantum infor- 
mation as a joint property of the qubits and carry more information than 
can be stored on the component parts separately. The quantum infor- 
mation in an entangled state is very delicate and is easily destroyed by 
perturbations of the relative phase and amplitude of the qubits, e.g. in 
present-day ion traps it is difficult to maintain coherence between more 
than a few qubits; this decoherence is caused by random perturbations 
that affect each qubit in a different way. 

The wavefunctions of the two electrons in helium are entangled but 
they do not give qubits useful for quantum computing. Nevertheless, 
since this is a book about atomic physics it is worthwhile to look back 
at helium. The antisymmetric state of the two spins has already been 
used as an example of entanglement. The symmetric spin wavefunction 
[ it) + IN) 1/2 also has entanglement, but the two other symmetric 
wavefunctions factorise: |tt) = |t),|t)., and similarly for |||). The two 
electrons are in these eigenstates of S because of the exchange symme- 
try. When the two electrons do not have the same quantum numbers, n 
and /, the spatial wavefunctions are symmetric and antisymmetric com- 
binations of the single-electron wavefunctions and are entangled. These 
eigenstates of the residual electrostatic interaction also satisfy the re- 
quirement of exchange symmetry for identical particles. (Note that the 
energy levels and spatial wavefunctions would be the same even if the 
particles were not identical—see Exercise 3.4.) The exchange integrals 
in helium can be regarded as a manifestation of the entanglement of the 
spatial wavefunction of the two electrons that leads to a correlation in 
their positions, or an anticorrelation, making it more (or less) probable 
that the electrons will be found close together. From the quantum per- 
spective, the energy difference for two different entangled wavefunctions 
does not seem strange because we do not expect them to have the same 
properties, even if they are made up of the same single-electron states. 


13.2 A quantum logic gate 


Quantum computing uses nothing more than standard quantum mechan- 
ics but it combines operations on the qubits and quantum measurements 
in sophisticated ways to give amazingly powerful new methods for com- 
putation. Here we consider just one example of the transformations of 
the qubits, or quantum logic gates, that form the elementary building 
blocks (primitives) of a quantum computer. The controlled-NOT gate 
(CNOT) transforms two qubits, as shown in Table 13.1. 

The CNOT gate changes the value of the second qubit if and only 
if the first qubit has the value 1. The first qubit controls the effect 
of the gate on the second qubit. The truth Table 13.1 looks the same 
as that for ordinary logic, but a quantum logic gate corresponds to 
an operation that preserves the superposition of the input states. The 
quantum mechanical operator of the CNOT gate Ucnor acts on the 
wavefunction of the two qubits to give 


Ucnor {A |00) + B|01) + C10) + D|11)} 


(13.11) 
~ A|00) + B01) + C}11) + D/]10). 


Alternatively, the effect of this operation can be written as |10) © |11), 
with the other states unchanged. The complex numbers A, B, C and D 
represent the phase and relative amplitude of states within the super- 
position. 


13.2.1 Making a CNOT gate 


The CNOT gate is elementary but it turns out not to be the simplest 
gate to make in an ion trap quantum processor—it requires a sequence 
of several operations, as described in Steane (1997). So, despite the 
fact that quantum computing has been introduced into this book as an 
application of ion trapping, this section explains how to make a quan- 
tum logic gate for two spin-1/2 particles. This two-qubit system is not 
just a convenient theoretical case, but corresponds to actual quantum 
computing experiments carried out with the nuclear magnetic resonance 
(NMR) technique. Figure 13.4 shows the energy levels of the two inter- 
acting spins. 

We do not need to go into all the details of how this energy-level struc- 
ture arises to understand how to make a CNOT gate, but it is important 


Table 13.1 Truth table of the CNOT quantum logic gate. 


100) —> |00) 
101) —> |01) 
110) -> |11) 
111) + |10) 
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Fig. 13.4 (a) The energy levels of two spin-1/2 particles which do not interact with each other. The frequencies that drive 
the transition between the levels of Qubit 1 and Qubit 2 are w; and we, respectively. In NMR these levels correspond to the up 
and down orientations of two protons in a strong magnetic field, i.e. the states |m; = +1/2) for each proton. The difference in 
the resonance frequencies w; # w2 arises from interaction with their surroundings (other nearby atoms in the molecule). (b) 
The energy levels of two interacting particles, or qubits. The interaction between the qubits makes the frequency required to 
change the orientation of Qubit 2 depend on the state of the other qubit (and similarly for Qubit 1). To denote this, the two 
new resonance frequencies close to w2 are labelled with the superscripts |x) and |1z). (Note this rather cumbersome notation 
is not generally used but it is useful in this introductory example.) The absorption spectrum is drawn below the corresponding 
transitions. A a-pulse of radio-frequency radiation at angular frequency wi!) switches Qubit 2 (|0). © |1).) ¢f and only if 
Qubit 1 is in |1),. This gives the CNOT gate of Table 13.1. For details of the NMR technique see Atkins (1994). In NMR the 
interaction of the nuclear magnetic moment with a strong external magnetic field g;431-B dominates. For a field of 10 T the 
protons at the centre of the hydrogen atoms in the sample have resonance frequencies of 400 MHz. This frequency corresponds 
to approximately w1/2m ~ w2/2m in the figure. The chemical shift causes w; and w2 to differ by only a few parts per million, 
but this is resolved by standard NMR equipment. Atkins (1994) describes how the fine structure in NMR spectra, shown in 
(b), arises from spin-spin coupling between the two nuclear spins. 


5The amplitudes and phases of the to know that spontaneous emission is negligible. To make our discussion 
states within the superposition must Jess abstract, it is also useful to know that in NMR experiments the 
We ae a energy differences between the levels shown in Fig. 13.4 arise from the 
time-scale of the experiment. orientation of the magnetic moment of the nuclei, proportional to their 
spin, in a strong magnetic field and that radio-frequency radiation drives 
transitions between the states. There is a splitting of hw, between the 
up and down states of the first qubit (|0) and |1)) and hw. for the 
second qubit. Therefore a pulse of radio-frequency radiation at angular 
frequency w; changes the orientation of the first spin (Qubit 1); for 
example, a 7-pulse (as defined in Section 7.3.1) swaps the states of this 
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qubit |0) < |1), i.e. aj0)+5]1) + al1)+0)0). Similarly, pulses at angular 
frequency w2 manipulate the state of the other spin (Qubit 2). These 
changes in the state of each qubit independently are not sufficient for 
quantum computing. The qubits must interact with each other so that 
one qubit ‘controls’ the other qubit and influences its behaviour. A 
small interaction between the spin causes the shift of the energy levels 
indicated in going from Fig. 13.4(a) to (b)—the energy required to flip 
Qubit 2 now depends on the state of Qubit 1 (and vice versa). The energy 
levels in Fig. 13.4(b) give four separate transition frequencies between 
the four states of the two qubits; a pulse of radio-frequency radiation 
that drives one of these transitions can achieve the CNOT operation—it 
changes the state of Qubit 2 only if Qubit 1 has state |1).® 

Although it seems simple for the spin-1/2 system, the implementa- 
tion of a quantum logic gate represents a major step towards building 
a quantum computer. Rigorous proofs based on mathematical proper- 
ties of unitary operators in quantum mechanics show that any unitary 
operation can be constructed from a few basic operators—it is sufficient 
to have just one gate which gives control of one qubit by another, such 
as the CNOT, in addition to the ability to manipulate the individual 
qubits in an arbitrary way.’ These operators form a so-called universal 
set which generates all other unitary transformations of the qubits. 


13.3 Parallelism in quantum computing 


A classical computer acts on binary numbers stored in the input regis- 
ter, or registers, to output another number also represented as bits with 
values 0 and 1. A quantum computer acts on the whole superposition of 
all the input information in the qubits of its input register, e.g. a string 
of ions in a linear Paul trap. This quantum register can be prepared 
in a superposition of all possible inputs at the same time, so that the 
quantum computing procedure transforms the entire register into a su- 
perposition of all possible outputs. For example, with N = 3 qubits a 
general initial state containing all possible inputs has the wavefunction 


W = A|000) + B\001) + C|010) + D|011) 


13512 
+ E|100) + F|101) + G|110) + H|111). ee 


Quantum computing corresponds to carrying out a transformation, rep- 
resented by the quantum mechanical operator U, to give the wavefunc- 
tion ©’ = UW that is a superposition of the outputs corresponding to 
each input 


Ww = AU |000) + BU |001) + CU |010) +... (13.13) 


Useful quantum algorithms combine different Choe such as 
etean to give an overall transformation U = om ae and these 
operations have the same advantage of parallelism as for the individ- 
ual operations. It appears that, instead of laboriously calculating the 
output for each of the binary numbers 000 to 111, corresponding to 0 


6 The coupling between trapped ions 
arises from the mutual electrostatic re- 
pulsion of the ions, but this interaction 
between qubits does not give a level 
structure like that in Fig. 13.4. 


7To rotate the nuclear spins to an ar- 
bitrary point on the Bloch sphere the 
NMR experiments use radio-frequency 
pulses. Experiments in ion traps use 
Raman pulses (Section 9.8). 


290 Quantum computing 


8 The need for safe methods of encrypt- 
ing information has always been cru- 
cial for the military, and nowadays it 
is important for confidential electronic 
transfer of data in business and to pro- 
tect credit card numbers. 


to 7 in decimals, the quantum computation does all eight calculations 
simultaneously. Herein lies the great power of quantum computing. 

The number of combinations of the input states increases exponen- 
tially with the number of qubits; for a register of 100 qubits this means 
a transformation of 21° = 10°° inputs in parallel—an astronomical num- 
ber. But how can the correct answer be picked out from the enormous 
number of possible outputs? All the possibilities encoded in the multiple- 
particle wavefunction V’ cannot be turned into a list of the outputs for 
a given input, as given by a classical computer. Indeed, a simple quan- 
tum measurement just gives one of the outputs chosen randomly from 
all the possibilities. However, quantum mechanics allows measurements 
that give some information about all of the outputs, if we renounce the 
possibility of getting information about any particular output—just as 
the complementarity principle stops us knowing both the position and 
momentum of a particle precisely. As a trivial example that gives the 
flavour of this idea, but is in fact not strictly correct, consider the mea- 
surement of the state of the last qubit in the output register—if all the 
possible outputs are even numbers then this ion will be in |0). So we 
can establish something about all the possibilities in a single quantum 
computation, but this does not allow any more information to be ex- 
tracted, i.e. if they are not all even we cannot find out which ones are 
odd (nor which input states gave these outputs). Similar ideas apply for 
more complicated quantum algorithms, e.g. finding the prime factors of 
numbers. (The real nature of quantum computation can only properly 
be appreciated by working through a quantum example, so the grossly 
simplified case given here must not be pushed too far.) 

The factorisation of large numbers is often quoted as a ‘killer appli- 
cation’ of quantum computing, i.e. something that they can do which 
cannot be done on existing computers (within a realistic time). It is ob- 
viously easy to multiply two prime numbers together, e.g. 37 x61 = 2257; 
but it is much more difficult to go in the opposite direction, e.g. to find 
the prime factors of 1271 (try it for yourself). For the larger numbers a 
classical computer can easily do the multiplication, but the factorisation 
of a number of order 10!°° would take an exceedingly long time even with 
the fastest supercomputers and the most efficient classical algorithm to 
search for the prime factors. It has been proved rigorously that quantum 
algorithms exist to attack this type of problem—the algorithms act like 
a filter only letting through the sought-after combination of qubits from 
the input superposition state. The practical impossibility of factorising 
large numbers on present-day computers forms the foundation of the 
best methods of encryption.? A quantum computer has the ability to 
try many combinations in each computation and so quickly find the key. 
The possibility of cracking the best currently-used codes has prompted 
such hard-headed bodies as the government security agencies to investi- 
gate quantum computing and fund research on ion traps. One wonders 
what they think of the intrigue and mystery of quantum mechanics. 


| 
| 
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13.4 Summary of quantum computers 


Qubits are quantum objects that store information and can exist in 
arbitrary superpositions a|0) + b|1). A quantum computer is a set of 
qubits on which the following operations can be carried out. 


(1) Each qubit can be prepared in a given state, so the quantum registers 
of the computer have a well-defined initial state. 


(2) In quantum computation the quantum logic gates (unitary transfor- 
mations) act on a chosen subset of the qubits. During these processes 
the system is in an entangled state where the information is encoded 
in the state of the entire quantum register. This cannot be reduced 
to a description of terms of the individual qubits, like a list of Os 
and Is for a classical computer. Some of these operations are control 
operations in which the change in the state of a qubit depends on 
the state of other qubits. 


(3) The final state of the qubits is read out by making a quantum mea- 
surement. 


For quantum information processing with a string of trapped ions, the 
three stages of initialisation, quantum logic and read out correspond to 
the following operations: 


(1) Preparation of the initial state—all of the ions must be cooled to 
the ground vibrational state and be in the same internal state. Any 
state |F', Mr) of a given hyperfine level will do and the choice is 
based on practical considerations.° 


(2) Raman transitions change both the internal and the vibrational 
states of the ions (qubits) to implement the operation of quantum 
gates. 


(3) Laser beams resonant with a strong transition determine which hy- 
perfine level each ion is in at the end of the process (Section 12.6). 
The ions lie at least 10 wm apart, so that each one is seen individually 
(agen Fig? 13.1). 


Quantum computing needs only a few basic types of quantum logic gate. 
A control gate between a single pair of qubits in a multiple-particle 
system can be combined with swap operations to effectively extend the 
operation to all pairs of qubits. These manipulations combined with 
arbitrary rotations of individual qubits give a universal set of operators 
from which all other unitary operators can be constructed. 


13.5 Decoherence and quantum error 
correction 


The power of quantum computing to solve important problems has 
been proved mathematically, but so far experiments have only used 


9tn the NMR experiments the prepara- 
tion of the initial state, or the resetting 
of the device after a computation, are 
not straightforward since those systems 
do not have a dissipative process equiv- 
alent to laser cooling. 
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10 at that time NMR experiments used 
up to seven qubits. 


ORC manages to make measure- 
ments on eigenstates that are in some 
sense orthogonal to those in the origi- 
nal superposition. ‘These special quan- 
tum measurements cause the small ad- 
mixture of other states introduced into 
the wavefunction by the noise to col- 
lapse to zero. The QEC measurements 
are performed on extra qubits entan- 
gled with the qubits in the quantum 
register; measurements made on these 
ancillary qubits do not reduce the size 
of the superposition state that stores 
quantum information (in the register), 
i.e. the number of qubits in the su- 
perposition is preserved in QEC. The 
technical details of this process are ex- 
plained in Steane (1998). Some elemen- 
tary aspects of QEC can be appreciated 
by analogy with the quantum Zeno ef- 
fect described in Exercise 13.5. 


a few qubits to demonstrate the principle of some elementary opera- 
tions. These experiments all use state-of-the-art techniques, but for 
large systems the external perturbations blur the entangled states so 
that they cannot be distinguished from each other, e.g. for ions in the 
Paul trap the electrical noise on the radio-frequency electrodes causes 
random changes of the phase of the states within a superposition. In the 
year 2000 the best ion trap experiment was limited to four qubits.1° De- 
coherence quickly causes a system of many qubits to become so muddled 
up that it is impossible to pick out the required output. In contrast, in a 
classical computer errors which change 0 into 1, or the other way round, 
occur extremely rarely, and present-day computers have procedures for 
correcting errors so that they have a negligible effect. 

Decoherence was thought to be an incurable disease in real quan- 
tum systems that would prevent quantum information processing from — 
ever being carried out with enough qubits to make it really useful. How- 
ever, & new cunning way of encoding information has been invented that 
cures the symptoms of decoherence. So-called quantum error correction 
(QEC) exploits the subtle features of the theory of quantum mechanics 
to get rid of the small amounts of the unwanted states that gradually 
get mixed into the states containing the quantum information by per- 
turbations on the system. Broadly speaking, making a certain type of 
quantum measurement causes the wavefunction of the system of qubits 
to ‘collapse’, in a way that destroys any additional phase or amplitude in- 
troduced by decoherence. This measurement must be made with respect 
to a very carefully chosen basis of eigenstates to preserve the entangle- 
ment of the wavefunction. Clearly, it is not adequate to make a simple 
measurement that causes quantum superpositions to collapse into just 
one of the states within the superposition, since this would completely 
destroy the coherence between the states that we want to preserve. 

In the context of ion traps, the recent advances in quantum com- 
puting and QEC can be regarded as the latest steps in the evolution 
of spectroscopy and laser cooling, as illustrated by the following rough 
history. 


(1) The first spectroscopists observed light from discharge lamps, e.g. 
the Balmer lines in atomic hydrogen. They used spectrographs and 
étalons and the resolution was limited by Doppler broadening, to- 
gether with collisions and other broadening mechanisms in the dis- 
charge. 


(2) Atomic beams allowed experimenters to change the population be- 
tween the various energy levels of the atoms, using radio-frequency 
or microwave radiation to manipulate hyperfine and Zeeman levels 
in the ground state. (Optical pumping was applied to the ground 
states of certain atoms.) Atoms were deflected slightly in the Stern— 
Gerlach experiment but without a significant change in speed. The 
laser extended these techniques to the higher levels using optical 
transitions, so that, in principle, atomic physics experiments can 
manipulate the internal states of the atoms and put atoms into any 


desired energy level. 
(3) Ion traps confined the ions for long periods with buffer gas cooling. 


(4) The advent of laser cooling of atoms meant that spectroscopists were 
no longer passive observers, but could magnetically trap neutral 
atoms and control the atom’s motion. Laser cooling of ions can put 
them into the ground state of the trap to give a completely defined 
external state. 


(5) Quantum computing techniques allow the manipulation of both the 
external and internal state of the trapped ion qubits with laser light, 
to put a system of many ions into a particular quantum state. 


(6) Quantum error correction can be regarded as a refined form of laser 
cooling. QEC puts the ions back into a coherent superposition of 
the desired states of the system (actually a particular sub-space of 
Hilbert space), rather than the lowest energy state as in conven- 
tional cooling. It was a real surprise that decoherence in quantum 
mechanics can be overcome in this way and actively stabilise quan- 
tum information. 


13.6 Conclusion 


At heart, quantum mechanics remains mysterious. Certain aspects of 
quantum behaviour seem strange to our intuition based on the classi- 
cal world that we experience directly, e.g. the well-known examples of 
‘which way the photon goes’ in a Young’s double-slit experiment and 
Schrédinger’s cat paradox.'? These examples have provoked much dis- 
cussion and thought over the years about the so-called quantum mea- 
surement problem. Nowadays, physicists do not regard the peculiarities 
of quantum systems as a problem at all, but rather as an opportunity. 
A proper appreciation of the profoundly different properties of multiple- 
particle systems and the nature of entanglement has shown how to use 
their unique behaviour in quantum computing. Just as a car mechanic 
uses her practical knowledge to get an engine working, without worry- 
ing about the details of internal combustion, so a quantum mechanic 
(or quantum state engineer) can design a quantum computer using the 
known rules of quantum mechanics without worrying too much about 
philosophical implications. Physicists, however, strive towards a bet- 
ter understanding of the quantum world and consideration of the pro- 
found and subtle ideas in quantum information theory sheds new light 
on aspects of quantum mechanics. In physics research there often exists 
a symbiotic relationship between theory and experimental work, with 
each stimulating the other; a prime example is quantum error correc- 
tion which was not thought about until people started to face up to 
the fact that none of the systems used in existing experiments has any 
possibility of working reliably with a useful number of qubits (without 
error correction)—of course, the problem of decoherence has always been 
recognised but experimental results focused attention on this issue. 


13.6 Conclusion 293 


12 See quotation from Heisenberg in the 
background reading section of the Pref- 
ace. 
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The theoretical principles of quantum computing are well understood 
but there are many practical difficulties to overcome before it becomes 
a reality. All potential systems must balance the need to have interac- 
tions between the qubits to give coherent control and the minimisation 
of the interactions with the external environment that perturb the sys- 
tem. Trapped ions have decoherence times much longer than the time 
needed to execute quantum logic gates and this makes them one of the 
most promising possibilities. In quantum computing many new and in- 
teresting multiple-particle systems have been analysed and, even if they 
cannot be realised yet, thinking about them sharpens our understand- 
ing of the quantum world, just as the EPR paradox did for many years 
before it could be tested experimentally. 


Further Reading 


The Contemporary Physics article by Cummins and Jones (2000) gives 
an introduction to the main ideas and their implementation by NMR 
techniques. The books by Nielsen and Chuang (2000) and Stolze and 
Suter (2004) give a very comprehensive treatment. The article on the 
ion-trap quantum information processor by Steane (1997) is also useful 
background for Chapter 12. Quantum computing is a fast-moving field 
with new possibilities emerging all the time. The latest information can 
be found on the World-Wide Web. 
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Exercises 


(13.1) Entanglement (e) Discuss whether the three-qubit state V = 


000) + }111) possesses entanglement. 
(a) Show that the two-qubit state in eqn 13.8 is oe gem ee 8 i 


not entangled because it can be written as a (13.2) Quantum logic gates 


simple product of states of the individual par- This question goes through a particular example 
ticles in the basis |0’) = (|0) — |1))//2 and of the statement that any operation can be con- 
|1’) = (0) + |1))/v2. structed from a combination of a control gate and 
(b) Write the maximally-entangled state |00) + arbitrary rotations of the individual qubits. For 
\11) in the new basis. trapped ions the most straightforward logic gate 


to build is a controlled ‘rotation’ of Qubit 2 when 


yi = ? 
(c) Is }00) + |01) —|10) + {11) an entangled state? Ginny ve. 


[Hint. Try to write it in the form of eqn 13.6 
and find the coefficients. | Ucror {A|00) + B01) + C10) + D |11)} 


(d) Show that the two states given in eqns 13.9 = A|00) + B|01) + Cl10) — D{11 
and 13.10 are both entangled. 100) eng! a 


(13.3) 


(13.4) 


(a) Write Ucror as a 4 x 4 matrix and show that 
it is unitary. 

(b) Equation 13.3 defines the Hadamard transfor- 
mation. Write this transformation for Qubit 
2, Uy(2) as a 4x 4 matrix with the same basis 
states as in (a), and show that it is unitary. 


(c) Find the combination of Ucror, Ux(2) and 
Uj, (2) that gives the CNOT gate (Table 13.1). 


Elementary operations with more than two qubits 
The CNOT gate where the state of Qubit 1 con- 
trols whether Qubit 2 switches (Table 13.1) cor- 
responds to the operator Ucnor(1, 2). Note that 
Uonor (1,2) # Ucnor(2, 1) and that these gates 
do not affect other qubits. 


(a) A system of three qubits has a quantum bus 
that enables operations that swap the states 
of any pair of qubits: Uswar(1, 2) for 1 + 2, 
Uswap(2, 3) for 2 4 3 and Uswar(1, 3) for 
1 + 3. What combination of Uconor(1, 2) and 
SWAP operators gives Ucnor(1,3), a CNOT 
gate where Qubit 1 controls Qubit 3? 


(b) The operator Uy(i) gives the transforma- 
tion |0) 4 (|0) + |1))/W2 for the ith qubit. 
Show that Un(3)Un(2)Un(1) acting on |000) 
gives the three-qubit state with all coefficients 
eEqualmnaine va Au— ee —a(Ca— 1) — sii 
G = H in eqn 13.12. 

(c) The three operations in part (b) prepare eight 
input states, i.e. they put a three-qubit reg- 
ister into a superposition of the eight states. 
How many inputs are prepared by thirty such 
operations on thirty qubits. Is the initial state 
of the quantum register prepared in this way 
entangled? 


Grover’s search algorithm 

Imagine trying to find a given number in the tele- 
phone directory when you have written down the 
number but not the name. There is no clever clas- 
sical way of speeding up this tedious task but the 
massive parallelism of a quantum algorithm does 
make a big difference to this sort of problem. In 
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this question we consider finding a particular value 
of a two bit number x from the list of four possibil- 
ities, but the principles can be extended to larger 
numbers. To implement the search the quantum 
computer uses an operation in which the sign of 
the input changes for the specified value—if we 
assume the answer is x = |11) then the operation 
causes |11) +» —|11) but leaves the other states 
unchanged. Instead of a simple list, the assign- 
ment of xz could be made on the basis of a so- 
lution to an algebraic problem, e.g. the operator 
changes the sign of the input z if some function of 
x has a particular value such as f(z) = 0. In order 
that the computer can implement the sign change 
efficiently it is only necessary that the function 
f(x) can be evaluated efficiently for some general 
z. This is a much easier task than finding the value 
of x for which f(x) = 0. 

The figure shows the sequence of operations corre- 
sponding to Grover’s algorithm, starting with two 
qubits in |00). The operators are defined in the fol- 
lowing. The letters (a) to(f) indicate the part of 
this question corresponding to each stage. (Ignore 
normalisation factors throughout this question.) 


(a) Show that the Hadamard transformation 
(eqn 13.3) applied to each of the qubits of |00) 
results in the superposition state 


Win = |00) + |01) + [10) + |11) . 


This is the initial state of the two qubits 
needed for the algorithm. 


(b) The box labelled f(x) represents the oper- 
ation for the function, as described above. 
In this question this corresponds to a CROT 
gate (defined in Exercise 13.2). Write down 
Ucror Vin. 

(c) The next step in the algorithm is another 
Hadamard transformation of each qubit. We 
have already worked out the effect on |00) 
in part (a). Work out the effect on |01), 
|10) and |11). Show that the superposition 
is 100) + |01) + |10) — [11). 


(e) (f) 
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(13.5) 


(d) The operator Uz corresponds to |0) +> |0) and 
|1) ++ —|1), for each qubit. Write down the 
resulting state of the system of two qubits. 


Show that after another Ucror the state is 
|00) — |O1) — |10) + |11). 

Show that a final Hadamard transformation 
yields the required answer. 


Repeat the algorithm for another function 
with solution x = 10, to show that it picks 
out the required value, i.e. f(x) corresponds 
to |10) + —|10). 


Comment. This algorithm has a similar ‘one-way’ 
nature to the factorisation of numbers mentioned 
in the text—it is very time consuming to find the 
prime factors, but once possible solutions have 
been found by the quantum computation it is easy 
to verify whether they are indeed factors (or satisfy 
the equation) using a classical computer. Some 
more complex quantum algorithms do not always 
give the ‘correct’ answer but they are still useful 
in ‘winnowing the wheat from the chaff’; the al- 
gorithms pick out all the required answers (grains 
of wheat) along with a few unwanted numbers. 
This is not a problem since invalid numbers are 
thrown away after checking—the set of numbers 
to be checked is much smaller than the initial set 
of all possibilities, so the procedure is efficient. 


The quantum Zeno effect 

The ancient Greek philosopher Zeno proposed var- 
ious arguments against motion, e.g. Achilles and 
the tortoise in which the logic leads to the para- 
doxical conclusion that the man cannot overtake 
the tortoise. In the quantum Zeno effect the evo- 
lution of a wavefunction is slowed down by re- 
peated quantum measurements on the system, and 
this has been described by the colloquial phrase ‘a 
watched pot never boils’. 

This question is based on the usual treatment of 
Rabi oscillations rewritten in terms of the states of 
a qubit that starts in |0) at time ¢ = 0. Transitions 
between the states are induced by a perturbation 
which is characterised by the Rabi frequency 2: 


iene (F) een () 11). 


(13.6) 


(a) The state of the qubit is measured after a short 
time T < 1/Q. Show that the probability that 
the qubit ends up in |1) is (Qr/2)? < 1. What 
is the probability that the qubit is in |0) after 
the measurement? 

The state of the qubit is measured at time 7/2. 
What are now the probabilities of |0) and |1)? 


After the measurement at time 7/2 the qubit 
will be in either |0) or |1). The qubit then 
evolves for time 7/2 and its state is measured 
again at time 7. Calculate the probability of 
the outcome |1). 


(b) 


For a sequence of n measurements with a time 
interval of r/n between them, the probability 
of having made a transition from |0) to |1) at 
time 7 is 1/n times that in part (a). Verify 
this result for n = 3 or, if you can, justify it 
for the general case. 


(d) 


Discuss the application of these results to the 
measurement of the frequency of a narrow 
transition by the quantum jump technique, as 
described in Section 12.6. The transition rate 
on the weak transition decreases as the mea- 
surement periods of excitation on the strong 
transition become more frequent. What effect 
does this have on the measured line width of 
the narrow transition? 


Quantum error correction 

Classical computers encode each number with 
more than the minimum number of bits neces- 
sary; the additional ‘check’ bits allow the system 
to detect whether any of the bits has undergone a 
random change (caused by noise). In these error 
correction codes the binary code for each number 
differs from the string of Os and 1s for any other 
possible number by at least two bits, so a random 
change of a single bit leads to an invalid binary 
code that the computer rejects. In practice, error 
correction codes have the binary strings represent- 
ing valid entries ‘further apart’ to protect against 
errors in several bits. Quantum error correction 
(QEC) uses extra qubits to make the computa- 
tional states ‘further apart’ (in Hilbert space), so 
the system is more robust, i.e. it is harder for the 
computational states to become mixed. 


Qubits PSS SS ey 
Ir) 

| Possible 
\0) error 

I 
|0) : 
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(d) 


Key: | CNOT gate (Qubit 1 controls either Qubit 2 or Qubit 3) 


The above figure shows a particular QEC scheme. 
On the left-hand side, Qubit 1 starts in |x) and 
Qubits 2 and 3 both start in |0). The letters (a) to 
(d) indicate the part of this question correspond- 
ing to each stage. 


(a) 


To represent the qubit |x) = a|0) + b|1) this 
quantum error correction code uses Vin = 
a|000) + 6|111). Show that two CNOT gates 
acting on Qubits 2 and 3, with Qubit 1 as the 
control qubit in both cases, as indicated in the 
figure, encode |x) in this way. 

Suppose that a perturbation of Qubit 1 causes 
a bit-change error so that the state of the 
three qubits becomes a(a|000) + 6|111)) + 
B(al100) + 6|011)). Here 6 is the amplitude 
of the unwanted state mixed into the origi- 
nal state (and normalisation determines qa). 
Show that after another two CNOT gates we 
get (a|0) + b|1)) a|00) + (a}1) + 6/0)) B]11). 


(c) 


(d) 


The measurement of the states of Qubits 2 
and 3 has two possible results. One possibility 
is to find |11), which means that Qubit 1 has 
changed and in this case the error is corrected 
by applying a (conditional) NOT operation to 
Qubit 1, ie. |0) + |1) if and only if the mea- 
surement of the other qubits gives |11). What 
is the other possible state of Qubits 2 and 3 
resulting from the measurement? Verify that 
Qubit 1 ends up in the original state |x) after 
this stage. 


Qubits 2 and 3 are both reset to |0). Two fur- 
ther CNOT gates recreate W;, from |x), just 
as in stage (a). 


This scheme corrects ‘bit flip’ errors in any of the 
qubits. Show this by writing out what happens in 
stages (b), (c) and (d) for the state 


a (a |000) + b|111)) + 8 (a|100) + b{011)) 


+ y(a|010) + 6|101)) + 6 (a 001) + 6|110)). 
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1 This is normally the case when levels 
are bound states. We shall find that 
the eigenenergies depend on K?, which 
would generalise to |K|?. 


Appendix A: Perturbation 
theory 


Although degenerate perturbation theory is quite simple (cf. the series 
solution of differential equations used to find the eigenenergies of the 
hydrogen atom), it is often regarded as a ‘difficult’? and subtle topic 
in introductory quantum mechanics courses. Degenerate perturbation 
theory arises often in atomic physics, e.g. in the treatment of helium 
(eqn 3.14). Equations 6.34 and 7.89 in the treatment of the Zeeman 
effect on hyperfine structure and the a.c. Stark effect, respectively, also 
have a similar mathematical form, as can be appreciated by studying 
the general case in this appendix. Another aim of this appendix is 
to underline the point made in Chapter 3 that degenerate perturbation 
theory is not a mysterious quantum mechanical phenomenon (associated 
with exchange symmetry) and that a similar behaviour occurs when two 
classical systems interact with each other. 


A.1 Mathematics of perturbation theory 


The Hamiltonian for a system of two levels of energies FE; and E2 (where 
Ey > E,) with a perturbation given by H’, as in eqn 3.10, can be written 
in matrix form as 


Ho+H! = (7 ) +{ 11 e) (A.1) 
0 £, A>, Hy9} ° , 

The matrix elements of the perturbation are Hj, = (~,|H'|y2), and sim- 
ilarly for the others. The expectation values (y1|H’|¢1) and (q2|H"|w2) 
are the usual first-order perturbations. It is convenient to write the 
energies in terms of a mean energy J and an energy interval 2€ that 


takes into account the energy shift caused by the diagonal terms of the 
perturbation matrix: 


Fo + Hyg =JS +e. 


For simplicity, we assume that the off-diagonal terms are real, e.g. as for 
the case of exchange integrals in helium and the other examples in this 
book:* 


Hi, = Hy, = K. 
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When € = 0 this leads to a matrix eigenvalue equation similar to that 
used in the treatment of helium (eqn 3.14), but the formalism given here 
enables us to treat both degenerate and non-degenerate cases as different 
limits of the same equations. This two-level system is described by the 


matrix equation 
J+e K a a 
ree) "|: aa 


The eigenenergies E are found from the determinantal equation (as in 
eqn 7.91 or eqn 3.17): 


Bad eV kK. (A.3) 


This exact solution is valid for all values of K, not just small perturba- 
tions. However, it is instructive to look at the approximate values for 
weak and strong interactions. 


(a) Degenerate perturbation theory, K > 2c 
If K > 2e then the levels are effectively degenerate, i.e. their energy 
separation is small on the scale of the perturbation. For this strong 
perturbation the approximate eigenvalues are 


E=J+K, (A.4) 


as in helium (Section 3.2). The two eigenvalues have a splitting of 
2K and a mean energy of J. The eigenfunctions are admixtures of 
the original states with equal amplitudes. 


(b) Perturbation theory, K < 2 
When the perturbation is weak the approximate eigenvalues ob- 
tained by expanding eqn A.3 are 


p=Ja(e+%). (A.5) 


This is a second-order perturbation, proportional to K“, as in eqns 
6.36 and 7.92. 


A.2. Interaction of classical oscillators of — 
similar frequencies 


In this section we examine the behaviour of two classical oscillators of 
similar frequencies that interact with each other, e.g. the system of two 
masses joined by three springs shown in Fig. 3.3, or alternatively the 
system of three masses joined by two springs shown in Fig. A.1. The 
mathematics is very similar in both cases but we shall study the latter 
because it corresponds to a real system, namely a molecule of carbon 
dioxide (with M, representing oxygen atoms and Mp2 being a carbon 
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Fig. A.1 An illustration of degener- 
ate perturbation theory in classical me- 
chanics, closely related to that shown 
in Fig. 3.3. (a) The ‘unperturbed’ sys- 
tem corresponds to two harmonic oscil- 
lators, each of which has a mass M, 
attached to one end of a spring and 
with the other end fixed. These inde- 
pendent oscillators both have the same 
resonant frequency w;. (b) A system 
of three masses joined by two springs 
corresponds to two coupled harmonic 
oscillators. When Mz > M;, the cou- 
pling is weak. The displacements 21, 
a2 and x3 are measured from the rest 
position of each mass (and are taken as 
being positive to the right). One eigen- 
mode of the system corresponds to a 
symmetric stretch in which the central 
mass does not move; therefore this mo- 
tion has frequency w; (independent of 
the value of M2). (c) The asymmetric 
stretching mode has a frequency higher 
than w;. (This system gives a simple 
model of molecules such as carbon diox- 
ide; a low-frequency bending mode also 
arises in such molecules.) 


(b) 
GUNG) nT (an) 
aor eee rc l a 
1 t= 0 KE] 
(c) ' 
Ou) 
Ey 
atom). The equations of motion for this ‘ball-and-spring’ molecular 
model are 
M,2 = K (22 = £1) - (A.6) 
Mo2z2 = —K(x2 = £1) aP K(x3 a £2) ; (A.7) 
M23 = —k(2x3 = x2), (A.8) 


where « is the spring constant (of the bond between the carbon and 
oxygen atoms). The x-coordinates are the displacement of the masses 
from their equilibrium positions. Adding all three equations gives zero 
on the right-hand side since all the internal forces are equal and opposite 
and there is no acceleration of the centre of mass of the system. This 
constraint reduces the number of degrees of freedom to two. It is con- 
venient to use the variables u = z2 — 2, and v = 43 — £2. Substituting 
k/M, = w} and «/M2 = w3 leads to the matrix equation 


u\ _ (— (wi +3) we u 
| a ae) ee 
A suitable trial solution is 
Uu a iw 
(*) = (2) oom an 


This leads to an equation with the same form as eqn A.2 with € = 0 (cf. 
eqn 3.14). 


A.2 Interaction of classical oscillators of similar frequencies 


The determinantal equation yields 
(wi —w?) (wp + 2w} —w”) =0, (A.11) 


giving the two eigenfrequencies w = w; and w! = ./w? + 2w%. For fre- 
quency &; the eigenvector is b = a, corresponding to a symmetric stretch 
(Fig. A.1(b)); this is the same as the initial frequency since M2 does not 
move in this normal mode. The other normal mode of higher frequency 
corresponds to an asymmetric stretch with b = —a (see Fig. A.1(c)). In 
these motions the centre of mass does not accelerate, as can be ‘verified 
by summing eqns A.6, A.7 and A.8. 

A treatment of two coupled oscillators of different frequencies is given 
in Lyons (1998) in the chapter on normal modes; this is the classical 
analogue of the non-degenerate case that was included in the general 
treatment in the previous section. The books by Atkins (1983, 1994) 
give a comprehensive description of molecules. 
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'The notation indicates that the elec- 
trons are labelled 1 and 2, meaning that 
their locations are called r; and ro. 
At the same time, the electron charges 
are (probably) differently distributed in 
space (because the electrons’ wavefunc- 
tions have different quantum numbers), 
so pi and p2 are different functions of 
their arguments. 


We have called the integral J in antic- 
ipation of working out a direct integral, 
but an exchange integral is equally well 
catered for if we use appropriate formu- 
lae for pi(ri) and p2(r2). 


3-The integrals are written with the in- 
tegrand at the end to make clear the 
range of integration for each variable. 


Appendix B: The 
calculation of electrostatic 
energies 


This appendix describes a method of dealing with the integrals that 
arise in the calculation of the electrostatic interaction of two electrons, 
as in the helium atom and other two-electron systems. Consider two 
electrons! whose charge densities are p1(r;) and p2(r2). Their energy of 
electrostatic repulsion is (cf. eqn 3.15) 


2 
€ 
a [f exe) g——palea) or d°ry. 
MEQ 12 


In this expression we shall leave open the precise form of the two charge 
densities, so the theorem we are to set up will apply equally to direct 
and to exchange integrals.? Also, we shall permit the charge density 
pi(ri) = p(ri,61,¢1) to depend upon the angles 6; and ¢,, as well as 
upon the radius r;, and similarly for p2(r2); neither charge density is 
assumed to be spherically symmetric. 

In terms of the six spherical coordinates the integral takes the form 


(B.1) 


WT 20 T 20 
i / dé, sin af ao, | dA sin A dd 
0 0 0 0 


2 
e 


co co 
x if dri ry pi(ri, 41, 6) [ dr2 13 p2(r2, 2,2) _—— . 
0 0 ATED T12 
In rearranging this expression, we process just the two radial integrals. 
Within them, we divide the range for rj into a part from 0 to r; and a 
part from r, to infinity. The radial integrals become? 
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2 
e€ 
is dry re P1 wm) dr2 re P2 (r2) ———— 
ATE Ti2 
2 
+ drir r )f dro r es 
fe 1Ty p(t 2 2 Pa(ta) 7 
1 . 2 
drir d ie 
- i" TT PA wn) [ r27 AUS eee 
: Ts 5 2 
il dr2r r ih drir r, )———_ 
f 2 P2(F2) ace i pil Lt ieaere 


= i dr; re Pl (ri) Voi (ri) == if dro re p2(r2) Vi2(r2) : 
0 0 
(B.2) 


Figure B.1 illustrates the rearrangement of the ranges of integration. 
Here V2i(r1) is the partial electrostatic potential at r; caused by the 
charge density p(r2), partial because it is caused by only that part of 
p(r2) that lies at radii less than r,; and a similar definition applies to 
Vi2(r2). The entire electrostatic energy J is now obtained by integrating 
expression B.2 over the angles 6;, $1, 92 and ¢2. We obtain 
j= fo (r;) Voi (r1) d°r; ar [ rates) Vio(r2) d?r5 : (B.3) 
Expression B.3 can be applied to any charge densities, however com- 
plicated their dependence on the angles @ and ¢. But the significance of 
our result is most easily explained if we take the special case where the 
potentials V2;(r,) and Vi2(r2) are spherically symmetric—independent 
of angles—either in fact or as the result of imposing an approximation. 
In such a case we can think of V2,(r;) as the (partial) potential result- 
ing from a radial electric field* that is felt by electron 1 owing to the 
presence of electron 2; and likewise Vj2(r2) is the (partial) potential felt 


(a) (b) 

r2 T22=7) r2 P= ii 
2 = 
Ty ry 


4tn the trade, the potential itself is usu- 
ally referred to as a radial, or central 
field. We should point out that the 
functions Vi9 and V2 are here not nec- 
essarily the same functions of their ar- 
guments; indeed, they will generally be 
different unless pi(r) = p2(r). Later 
in this book we shall discuss the cen- 
tral field, which is a single V(r), acting 
alike on all electrons. The potentials 
introduced here are different from this 
common-to-all central field. 


Fig. B.1 Integration over the region 
r2 > 71 > 0 can be carried out in two 
ways: (a) integration with respect to 
ro from rg = 71 to oo, followed by in- 
tegration from ri = 0 to 00; or (b) in- 
tegration with respect to ri; from 0 to 
ri = re, followed by integration from 
rg = 0 to oo. The latter is convenient 
for calculating the electrostatic interac- 
tion in eqn B.2 and it leads to two con- 
tributions that are related by an inter- 
change of the particle labels r1 © ro. 
For a symmetrical configuration such 
as ls? in helium these two contribu- 
tions are equal and this feature reduces 
the amount of calculation. By defini- 
tion, exchange integrals are unchanged 
by swapping the labels r; <+ r2 and so 
the same comments apply to their eval- 
uation. 
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5A similar simplification can be used in 
the calculation of exchange integrals, as 
in Exercise 3.6. 


®The physical idea that each electron 
should ‘feel’ a potential due to the other 
is So intuitive that there can be a strong 
temptation to write the interaction en- 
ergy down twice—once in the form B.4 
and a second time with 1 and 2 inter- 
changed. If this is done, the interaction 
energy is double-counted. The advan- 
tage of the form B.3 is precisely that it 
conforms to the intuitive idea without 
double-counting. 


by electron 2 owing to the presence of electron 1. When p(ri) = p(r2), 
as in the calculation of the direct integral for the 1s? configuration (Sec- 
tion 3.3.1), these two parts of eqn B.3 are equal (and only one integral 
needs to be calculated).® 

The fact that the potentials Vi2 and V2; are partial potentials deserves 
comment. Had we defined a V2;(r;) to represent the whole of the elec- 
trostatic potential from p2(r2) acting on electron 1, then the interaction 
energy J would have been given by the single term 


7 20 co 
i, dé; sin 6; / dd; / dr, re P1 (ri) Voi (r1) 
0 0 0 


(B.4) 
= f er) Vales) an, 

or by a similar expression with the labels 1 and 2 interchanged through- 
out.® Such a presentation is, of course, physically correct, but it is much 
less convenient mathematically than the version presented earlier; it 
precludes the separation of the Schrodinger equation into two equations, 
each describing the behaviour of one electron. 

The simplified form for J given in eqn B.3 is used in the evaluation of 
direct and exchange integrals for helium in Chapter 3, and is generally 
applicable to electrostatic energies. 


Appendix C: Magnetic 
dipole transitions 


The electric dipole transitions that lead to the emission and absorption 
of light by atoms are discussed in many places in this book, and the 
selection rules for this type of transition are summarised in Table 5.1. 
These rules are not obeyed in radio-frequency transitions between Zee- 
man sub-levels where only the magnetic quantum number changes, or 
in transitions between different hyperfine levels where F and Mr can 
change—these transitions in radio-frequency spectroscopy are magnetic 
dipole or M1 transitions induced by the oscillating magnetic field of the 
radiation! 

By = Bo coswt . (Cay 


The transition matrix element between hyperfine levels is 
boi x (2| 4 Bre |1), (C.2) 


where yt is the magnetic dipole operator in eqn 5.9. This gives transitions 
for which Al = AL = AS = 0 and the only interesting selection rules 
are those for F and Mr: 


AF =0,+1 (but not0—-0), 
AMy =0,+1 (note that AF = 0 = AMF gives no transition) . 


These are as expected from angular momentum conservation and a 
dipole operator that can change angular momentum by one unit (as 
discussed for electric dipole transitions in Section 2.3.5). 

The spontaneous decay rate for radio-frequency transitions is propor- 


tional to ; 
Aa ow lwoil , (G:3) 


where w is the angular frequency, which is small relative to optical tran- 
sitions. The matrix element 12 is also much smaller than that for El 
transitions: 


BID? 2 

|(3| er - E|1)| e€ag/Z 
where the factor c arises because the ratio of the magnetic to the elec- 
tric field, |B| /|E| = 1/c in an electromagnetic wave; the atomic size 


scales as 1/Z, see Section 1.9. Therefore spontaneous emission is gener- 
ally negligible in radio-frequency and microwave spectroscopy. In outer 
space, however, the gas density is low and there are vast clouds of atomic 


tan important exception is the Lamb 
shift transition, where two levels of op- 
posite parity have a very small energy 
separation (see Section 2.3.4). This is 
an El transition. 
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hydrogen; even weak emission at a wavelength of 21cm gives enough mi- 
crowave radiation to be picked up by radio telescopes. 

The selection rules for magnetic dipole radiation transitions are given 
in the table below, together with the electric dipole transitions for com- 


parison. 
Electric dipole transitions Magnetic dipole transitions 
AJ =0,+1' AJ =0,+1' 
(but not. J =0 tot’ = 0) (but not = 0 to 7 =O) 
AM; =0,+1' AM, =0,+1 
Parity change No parity change 
7S eho! Ar 0 No change of 
Any An ano \ configuration 
AG =, sell Sh = 0 
AS = (0) ixor ==) 


tIn hyperfine structure My is not a good quantum number and the rules 
for F and Mp apply instead. This is also true for #1 transitions, and is an 
extension of the selection rules in the Zeeman effect. 


In certain circumstances magnetic dipole transitions can give rise to 
visible transitions in atoms, and there are also electric quadrupole (E2) 
transitions arising from the breakdown of the dipole approximation. 
These forbidden transitions, i.e. forbidden for electric dipole transitions, 
are detailed in Corney (2000). 


Appendix D: The line 
shape in saturated 
absorption spectroscopy 


The description of saturated absorption spectroscopy in Section 8.3 ex- 
plained qualitatively how this technique gives a Doppler-free signal. 
This appendix gives a more quantitative treatment based on modify- 
ing eqn 8.11 to account for the change in populations produced by the 
light. We shall use N; (v) to denote the number density of atoms in 
level 1 with velocities v to v + dv (along the direction of light) and 
Nz (v) to denote those in level 2 within the same velocity class. At low 
intensities most atoms remain in the ground state, so N; (v) ~ Nf (v) 
and N2(v) ~ 0. Higher intensity radiation excites atoms close to the 
resonant velocity (given in eqn 8.12) into the upper level, as shown in 
Fig. 8.4. Within each narrow range of velocities v to v + du the ra- 
diation affects the atoms in the same way, so we can use eqn 7.82 for 
homogeneous broadening to write the difference in population densities 
(for atoms in a given velocity class) as 


1 
14+ (1/Tgat)L (w — wo + kv)” 


N, (v) — No(v) = Nf (v) x (D.1) 
This includes the Doppler shift +kv for a laser beam propagating in the 
opposite direction to atoms with positive velocities, e.g. the pump beam 
in Fig. 8.4. The Lorentzian function LD (w — wo + kv) is defined so that 
L (0) = 1, namely 


b/4 
ae D.2 
@ = 3 aa (D.2) 
For low intensities J] < Iza, we can make the approximation 
Uf 
N; (v) —Na(o) = NF(o) {I~ 3 L (w — aig + bo) b (D.3) 
sat 


The expression inside the curly brackets equals unity except near v = 
— (Ww — wo) /k and gives a mathematical representation of the ‘hole burnt’ 
in the Maxwellian velocity distribution Nf (v) by the pump beam (as 
illustrated in Figs 8.3 and 8.4). In this low-intensity approximation the 
hole has a width of Av = ['/k. Atoms in each velocity class absorb light 
with a cross-section given by eqn 7.76, with a frequency detuning that 
takes into account the Doppler shift. The absorption of a weak probe 
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beam travelling along the z-direction through a gas with a strong pump 
beam in the opposite direction is given by the integral in eqn 8.17: 


Dy fm (v) — No (v)} a (w —-kv) du 


= / N f(v) : 00 y?/4 
=| > _j} __sigiteao 15. See ce 
ase Tsat (W—wo+kv)? +1? /4 (w — Wo — kv) + /4 


(D.4) | 


Note the opposite sign of the Doppler shift for the probe (—kv) and pump 
(+kv) beams. (Both beams have angular frequency w in the laboratory 
frame.) For low intensities, I/Isa, < 1, the same approximation as in 
going from eqn D.1 to D.3 gives 


(a1) = Now ff (v) L(w~ uo — kv) 41-7 


sat 


1 (w~ up + hu) bd, 


(D.5) 
As I — 0, this reduces to eqn 8.11 for Doppler broadening without satu- 
ration, i.e. the convolution of f (v) and L (w — wo — kv). The intensity- 
dependent part contains the integral 


is f (v) L(w—wo — kv) L (w — wo + kv) du 


- eh. ee ee 
a of a +02/4 * 19 (w— wp) =a} +12 ee 
(D.6) 


The product of the two Lorentzian functions is small, except where both 
W— wo + kv = 0 and w — wo — ku = 0. Solving these two equations we 
find that the integrand only has a significant value when kv = 0 and 
Ww —Wo = 0. The Gaussian function does not vary significantly from 
f (v = 0) over this region so it has been taken outside the integral. The 
change of variables to z = w — wo + kv shows clearly that the integral 
is the convolution of two Lorentzian functions (that represent the hole 
burnt in the population density and the line shape for absorption of the 
probe beam). The convolution of two Lorentzian functions of widths I 
and I’ gives another of width +I’ (Exercise 8.8). The convolution of 
two Lorentzian functions with the same width [ = I’ in eqn D.6 gives a 
Lorentzian function of width + I’ = 21 with the variable 2 (w — wo); 
this is proportional to gy(w), as defined in eqn 7.77 (see Exercise 8.8). 
Thus a pump beam of intensity J causes the probe beam to have an 
absorption coefficient of 


ee 
kK (w) =N x35 zat 


Angp(s) {1~z—"au(w)}. — (D.7 


The function in the curly brackets represents the reduced absorption at 
the centre of the Doppler-broadened line—it gives the peak in the probe 
beam intensity transmitted through the gas when w = wo, as shown in 
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Fig. 8.4(b). This saturated-absorption signal comes from the velocity 
class of atoms with v = 0. 

We assumed that J < I,,4 to obtain eqn D.3, but usually in exper- 
iments the pump beam has an intensity close to saturation to give a 
strong signal. The simple rate equation treatment becomes inaccurate 
around J,,, and a more sophisticated approach is required based on the 
optical Bloch equations, as described in Letokhov and Chebotaev (1977). 
Also, optical pumping out of the state that interacts with the radiation 
into other Zeeman sub-levels, or hyperfine levels, can be the dominant 
way of depleting the lower-level population N; (v). However, the line 
shape remains symmetrical about wo, so saturation spectroscopy gives 
an accurate measurement of the atomic resonance frequency. 
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E.2 Two-photon transitions 313 


1More rigorous treatments can be 
found in quantum mechanics texts. 


2We assume that the Rabi frequency is 
real, ie. OF, = Oi. 


Appendix E: Raman and 
two-photon transitions 


E.1 Raman transitions 


This appendix gives an explanation of Raman (and two-photon transi- 
tions) by adapting the treatment of single-photon transitions given in 
Chapter 7—this approach gives much more physical insight than simply 
quoting the theoretical formulae from second-order time-dependent per- 
turbation theory. A Raman transition involves two laser beams with 
frequencies wy; and wy2, and the atom interacts with an electric field 
that has two frequency components: 


E = Ey cos (wy it) + Ep2 cos (wet) . (E.1) 


A Raman transition between two atomic levels, labelled 1 and 2, in- 
volves a third atomic level, as shown in Fig. 9.20. This third level is 
labelled i for intermediate, but it is very important to appreciate that 
atoms are not really excited to level 2. The treatment presented here 
emphasises that Raman transitions are fundamentally different from a 
process comprised of two single-photon transitions (1 + 7 followed by 
i + 2). As in Section 9.8, we take the frequencies of the levels to be 
related by w; > we > wu}. 

Firstly, we consider the perturbation produced by the light at wy for 
the transition between levels 1 and 7. This is the same situation as for 
a two-level atom interacting with an oscillating electric field that was 
described in Chapter 7, but here the upper level is labelled 7 instead of 
having the label 2 (and we write wy; rather than w). For a weak pertur- 
bation, the lower state |1) has amplitude c, (0) = 1 and the amplitude 
of |7), from eqn 7.14, is 


Ae Qa [1 — exp {i(ai — wi — wr) t} 
: 2 Wj — Wy — Wp} ; 
Here the Rabi frequency for the transition 0; is defined in terms of Ey; 


as in eqn 7.12. We define the difference between the laser frequency wy 
and the frequency of the transition between levels 1 and 7 as 


(E.2) 


Avs CO ar Wein = CK « (E.3) 
From eqn 7.76 the wavefunction of the atom is 


oe) em ol hs 


, a 
BOE as em NS 1 
ee 5A ° ee (E.4) 


The perturbation produces the admixture of two terms into the initial 
state |1) that both have the same small amplitude 0;;/|2A| < 1. We 
will find that the term with angular frequency w; represents real exci- 
tation to |i). The term with angular frequency w; + wy; corresponds 
to a virtual level, i.e. in the mathematics this term acts as if there is a 
level at an energy hwy, above the ground state that has the symmetry 
properties of |i), but in reality there is no such level. 

To determine the effect of the field oscillating at wy. on the perturbed 
atom, we take eqn 7.10, which states that for a single-photon transition 
ico = cos (wt) e”>%e,, and make the replacements w > wy2, wo > 
We — w; and 2 + Ny; to obtain® 


iCo (t) = Na cos (wpat ) el(¥2-“*)#e; (#) . (E.5) 
Insertion of the expression for c; (t) from eqn E.2 yields 
ico (t) a =O). cos (wrt) ei(wi—wa)t x a E = el(wi-wi at 
a NM iwpat —iwzat —i(w; -we)t i{(w2—w1)—wy1 }t 
a =e [e are | G le =€ : 
(E.6) 


Integration and the rotating-wave approximation lead to 


, @) _ MG ities e—i(wi—we—wr2)t 1 — ei{(w2—w1)—(wi1 -wi2) }t 
. 7 4A (OF = (bh) = C5) (we aa Ww) a (wri cae WL2) 
_ 2,041 —i(A+6)t Nit —idt 
= TA(A +6) [1 e AAS [1 e ie (E.7) 
where 
6 = (a1 — wre) — (we — w) (E.8) 


is the frequency detuning from the Raman resonance condition that the 
difference in the laser frequencies matches the energy difference between 
levels 1 and 2, over fi (see Fig. 9.20).4 Equation E.7 looks complicated 
but its two parts have a straightforward physical interpretation—we can 
find the conditions for which each part is important by examining their 
denominators. Raman transitions are important when 6 ~ 0 and A is 
large (|6| < |A]) so that the second part of eqn E.7 dominates (and the 
individual single-photon transitions are far from resonance). Defining 
an effective Rabi frequency as 


Q;,243 (2| ere E;2 i) (2| eke Ey |1) 
= SO E.9 
est Dae h2 (a SOU) = WL1) ; ( ) 
we can write eqn E.7 as 
Oop Ve A Neg 1 — ei! 
= $$ - — —_—_ E.10 
0 = a 2 5 ey) 
The first term can be neglected when |6| < |A| to yield 
1 ot 
|c2 Gil = Ken if sinc? () : (E.11) 
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3 The possibility that radiation at an- 
gular frequency wy; could drive this 
transition between 2 and 2 is considered 
later for two-photon transitions. 


4 Notice that this condition does not de- 
pend on w;. The Raman transition can 
be viewed as a coupling between |1) and 
|2) via a virtual level, whose origin can 
be traced back to the term (with a small 
amplitude) at frequency w, + wy in 
eqn E.4. However, although a virtual 
level gives a useful physical picture it 
is entirely fictitious—during a Raman 
transition there is negligible population 
in the excited state and hence negligible 
spontaneous emission. 
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°Raman transitions can impart mo- 
mentum to the atoms and this makes 
them extremely useful for manipulating 
atoms and ions (see Section 9.8). 


6Similarly, the quantity A + 6 = w; — 
w2 — Wp2 (in eqn E.7) is small when 
wi2 matches the frequency of the tran- 
sition between |z) and |2). For this con- 
dition the single-photon transition be- 
tween levels 2 and 2 is the dominant 
process. The single-photon processes 
can be traced back to the small ampli- 
tude with time dependence exp(iw;t) in 
eqn E.4. 


"The generalisation to the case where 
Q41 4 Qo; is straightforward. 


8 The hyperfine splitting is wo —w i = 
2x x 1.7GHz but we do not need to 
know this for this calculation. 


This is the same as eqn 7.15 for one-photon transitions, but Qe has re- 
placed 9. This presentation of Raman transitions has assumed a weak 
perturbation and we have found results analogous to those for the weak 
excitation of a single-photon transition (Section 7.1); a more comprehen- 
sive treatment of the Raman coupling between |1) and |2) with effective 
Rabi frequency Qe, analogous to that in Section 7.3, shows that Ra- 
man transitions give rise to Rabi oscillations, e.g. a a-pulse transfers 
all the population from 1 to 2, or the reverse. Raman transitions are 
coherent in the same way as radio-frequency, or microwave, transitions 
directly between the two-levels, e.g. a Raman pulse can put the atomic 
wavefunction into a coherent superposition state A|1) + B|2).° 

It is vital to realise that the Raman transition has a quite distinct na- 
ture from a transition in two steps, i.e. a single-photon transition from 
level 1 to i and then a second step from i to 2. The two-step process 
would be described by rate equations and have spontaneous emission 
from the real intermediate state. This process is more important than 
the coherent Raman process when the frequency detuning A is small 
so that wy, matches the frequency of the transition between |1) and 
|:).6 The distinction between a coherent Raman process (involving si- 
multaneous absorption and stimulated emission) and two single-photon 
transitions can be seen in the following example. 


Example E.1 The duration of a z-pulse (that contains both frequen- 
cies wp; and wy2) is given by 

ett, =. (E.12) 
For simplicity, we shall assume that both Raman beams have similar in- 
tensities so that Qi, ~ Q2; ~ N and hence Neg ~ N?/A (neglecting small 
factors).’ From eqn 9.3 we find that the rate of scattering of photons 
on the transition |1) to |é) is approximately [Q?/A?, since for a Raman 
transition A is large (A >> [). Thus the number of spontaneously- 
emitted photons during the Raman pulse is 


(TQ? cA 7 761 


Regcatttx A Q2 es) a , 


(E.13) 
This shows that spontaneous emission is negligible when A > IT. As 
a specific example, consider the Raman transition between the two hy- 
perfine levels in the 3s ground configuration of sodium® driven by two 
laser beams with frequencies such that A = 27 x 3GHz. The 3p ?P, /2 
level acts as the intermediate level and has [ = 27 x 10’s-!. Thus 
Rscattts ~ 0.01—this means that the atoms can be subjected to many 
m-pulses before there is a spontaneous emission event that destroys the 
coherence (e.g. in an interferometer as described in Section 11.7). Ad- 
mittedly, this calculation is crude but it does indicate the relative im- 
portance of the coherent Raman transitions and excitation of the in- 
termediate level by single-photon processes. (It exemplifies the order- 
of-magnitude estimate that should precede calculations.) The same ap- 


proximations give the duration of a z-pulse as 


ane a EE ~ 284 Isat 
Gea G2 = 1 a. 


(E.14) 


where eqn 7.86 has been used to relate 9? to intensity. Thus in a Raman 
experiment carried out with two laser beams, each of intensity 3/za;, and 
the above values of A and T for sodium, the pulse has a duration of 
= 10 ps. 
Tv 


E.2 Two-photon transitions 


The intuitive description of Raman transitions as two successive appli- 
cations of the first-order time-dependent perturbation theory result for 
a single-photon transition can also be applied to two-photon transitions 
between levels 1 and 2 via i, where w2 > w; > w,. The two-photon rate 
between levels 1 and 2 is 


>{ (2 |er - Ey2|2) (é ler - Ey;| 1) 


= 
A? (wi — wi — wr) 


i 


x g (wri + w2). 


(E.15) 
This has the form of the modulus squared of a sum of amplitudes mul- 
tiplied by the line shape function g(wy1 +wp2). There are two con- 
tributing amplitudes from (a) the process where the atom interacts with 
the beam whose electric field is Ey; and then with the beam whose 
field is Ey2, and (b) the process where the atom absorbs photons from 
the two laser beams in the opposite order.? The energy increases by 
R(wp1 + wp2) independent of the order in which the atom absorbs the 
photons, and the amplitude in the excited state is the sum of the am- 
plitudes for these two possibilities. In Doppler-free two-photon spectro- 
scopy (Section 8.4) the two counter-propagating laser beams have the 
same frequency wy; = WL2 = w, and we shall also assume that they 
have the same magnitude of electric field (as would be the case for the 
apparatus shown in Fig. 8.8). This leads to an excitation rate given by 


9 
2 Y ee 2:2 a1 
Wi —- (oh — Ww 


with 1;; and Qe; as defined in the previous section. The transition has 
a homogeneous width I’ greater than, or equal to, the natural width 
of the upper level T > IT; this Lorentzian line shape function has a 
similar form to that in eqn 7.77, with a maximum at the two-photon 
resonance frequency Wj) = W2 — Ww, (as in Section 8.4). The constraint 
that the two photons have the same frequency means that the frequency 
detuning from the intermediate level A = w; — (wi + w) is generally 
much larger than in Raman transitions, e.g. for the 1s—2s transition in 


, (2ler-Exilé) (ier - Ev2/ 1) a 
h? (w; — wy — wr2) 


T'/ (2m) 


hip = a a 
- (wre = Joy aF [2/4 


(E.16) 
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° Only one of these paths is near res- 
onance for Raman transitions because 
WL1 — WL2 #WL2 — Wh. 
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AUN rough estimate of the two-photon 
rate can be made in a similar way to the 
calculations for Raman transitions in 
the previous section (Demtréder 1996). 


hydrogen the nearest level than can provide an intermediate state is 
2p, which is almost degenerate with the 2s level (see Fig. 8.4); thus 
Ww; & wo and A ~ w = 2a x 1045s! (or 4 x 10° times larger than 
the frequency detuning used in the example of a Raman transition in 
the previous section). There are two important consequences of this 
large frequency detuning: (a) in real atoms there are many levels with 
comparable frequency detuning and taking these other paths (1 > i > 
2) into account leads to the summation over i in eqn E.16; and (b) 
the rate of two-photon transitions is small even for high intensities (cf. 
allowed single-photon transitions).° 


Appendix F: The statistical 
mechanics of 
Bose-Einstein 
condensation 


This appendix does not attempt to reproduce the standard treatment 
of a Bose-Einstein condensate found in statistical mechanics texts, but 
aims to give a complementary viewpoint, that emphasises the link be- 
tween photons and atoms, and also to describe BEC in a harmonic 
potential. 


F.1 The statistical mechanics of photons 


The Planck formula for the energy density of radiation per unit band- 
width p(w), used in the Einstein treatment of radiation (see eqn 1.29), 
can be written as a product of three factors: 


p(w) dw = hw x fon (W) X Don (w) dw. (F.1) 


Here fw is the photon energy, the function fpn(w) = 1/ (e%" — 1), 
with 6 = 1/kpT, determines the number of photons per energy level 
and Dpn (w) is the density of states per unit bandwidth.’ Although the 
distribution fpn becomes very large as w — 0 (infra-red divergence), 
the integration over the frequency distribution (using the substitution 
x = Bhw) yields a finite result for the total energy of the radiation in 
the volume V: = 

pa | p(w)dwa« VT". (F.2) 

0 

This result follows from dimensional considerations, without the evalu- 
ation of the definite integral.? This integral for E is a particular case 
of the general expression in statistical mechanics for the energy of the 
system which is obtained by summing the energy over all occupied levels: 


a= Das (€%) €. (F.3) 


Here f (€;) gives the distribution over the levels of energy ¢;. The integral 
in eqn F.2, for the particular case of photons, gives a close approximation 
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1'The number of states in phase-space 
with wavevectors between k and k+dk 
equals the volume of a spherical shell 
of thickness dk times the density of the 
states in k-space, 4mk? dk x V/(2m)?. 
For photons we need an extra factor 
of 2, because of the different possible 
polarizations, and the substitution k = 
w/c. 


2The energy density E/V has the 
same T* dependence as the Stefan— 
Boltzmann law for the power per unit 
area radiated by a black body (as 
expected, since cH#/V corresponds to 
power divided by area). 
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3 This may appear to be a circular argu- 
ment since the large occupation of the 
lowest level is the signature of the Bose— 
Einstein condensation that we want to 
investigate! This section shows that 
this is a consistent solution of the equa- 
tions for bosons, and the validity of 
this treatment can be appreciated more 
readily after deriving the equations. 


4We assume a thermal energy kp 
much greater than the spacing be- 
tween the energy levels. Otherwise, 
particles sit in the ground state sim- 
ply because of the Boltzmann factor 
exp {—6 (€1 — €0)} < 1, where €; is the 
energy of the first excited level (Sec- 
tion 12.9). 


®Otherwise there would be an energy 
for which e—p = 0; this would make the 
denominator in eqn F.5 equal to zero so 
that f (€) — oo. 


to this summation when the system occupies many levels to give an 
almost continuous distribution. The density of states in the integral 
represents a Summation over the number of levels within each frequency 
(or energy) interval dw (= de/h). 


F.2 Bose—Einstein condensation 


The crucial difference between photons and particles in statistical me- 
chanics is that photons simply disappear as the temperature tends to 
zero, as we can see from eqn F.2. In contrast, for a system of particles, 
such as a gas of atoms or molecules in a box, the number remains con- 
stant. This introduces a second constraint on the distribution function, 
namely that the sum of the populations in all the energy levels equals 
the total number V. Note that the symbol ‘NV’ is used here because ‘N’ 
was used in Chapter 7 to represent the number density (the convention 
usually adopted in laser physics). Here, number density is denoted by 
‘n’, as is usual in statistical mechanics. Hence 


N= df (ei). (F.4) 


This equation for conservation of number, and eqn F.3 for the total 
energy, apply to any system of particles. We shall consider particles 
with integer spin that follow the Bose-Einstein distribution function 


far (€) = : 


eben) — 1° 

This function has two parameters 6 and pu that can be determined by the 
two constraints. For the particular case of bosons at low temperatures, 
the chemical potential y has little consequence except for atoms in the 
lowest energy state, so that for the higher-lying levels fpp (€) closely 
resembles the distribution of photons! We can justify this assertion by 
considering the properties of a system with a significant population No in 
the ground state. The number of atoms in the lowest level with energy 
€g is given by 


(F.5) 


1 
acs ie, (F.6) 
Hence ; j 
Eo —P 
=] i, <b —— |) 2 —. A 
kat. n( x) No eG 


Einstein originally considered gases with a large total number of atoms, 
N ~ 1078, so, even when No is only a small fraction of NV, the difference 
Eo — pis negligible in comparison to the thermal energy kgT. (This ther- 
modynamic, or large number, approximation works even for samples of 
10° magnetically-trapped atoms.*) The equation shows that the chem- 
ical potential is lower than €o, the lowest energy level in the system.® 
For the first excited level we find that 
kpT 


€1 — w= (€1 — €0) + (€0 — BW) © hw + —— ~ fw. 
No 
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Hence, except for the ground-state population, can be neglected and 
fpr becomes the same as the distribution for photons: 


dl 


From what has been said above you may wonder how neglecting the 
chemical potential can be consistent with conservation of particle num- 
ber in eqn F.4. This equation can be expressed as an integral of f (ce) 
times the density of states for particles, D (e), 


N=No+ [Fle Die)de. (F.9) 
0 


The number in the ground state, No, has to be put in explicitly because 
the integral does not properly count these atoms. Effectively, we have 
replaced pz as a parameter by No (these are related by eqn F.7). The two 
terms in eqn F.9 give the number of particles in the two parts, or sub- 
systems, that make up the whole. From this perspective we regard the 
N — No particles in the excited states (€ > €9) as a sub-system that ex- 
changes particles with the condensate (atoms in the ground state). Thus 
atoms in the excited states behave as if there is no number conservation: 
N — No - 0 when T > 0, as for photons. 

The integral in eqn F.9 contains the distribution function from eqn F.8 
times the density of states for particles given by 


D(e) = AVel/? de, (F.10) 
where A is a constant.® With the substitution z = Be, eqn F.9 becomes 
No =N — AV (pT) C, (F.11) 


where ¢ represents the value of the integral given in statistical mechanics 


texts as ae a 
hy 1 
= dx = 2.6 x —_. Ele 

¢ ik aa er a 


The ground-state occupation goes to zero, No = 0, at the critical tem- 
perature Tc given by 


= = A(kpTc)* ¢. (F.13) 
With A = 27(2M)/2/h? and eqn F.12 for ¢, this gives eqn 10.14. The 
discussion here supposes that there is a large population in the lowest 
level (the Bose-Einstein condensate) and determines the temperature at 
which No goes to zero. (A different perspective adopted in many treat- 
ments is to consider what happens as atoms are cooled down towards 
Tc.) Dividing eqn F.11 by F.13 gives the fraction of particles in the 
ground state for a Bose gas in a box as 


La ey (F.14) 


6D(w) differs fundamentally from 
Dp» (w) in eqn F.1 because a particle’s 
energy is proportional to the square of 
its wavevector, € « k, ie. € = p?/2M 
with momentum p = hk. 
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ON large condensate has a chemical po- 
tential that is considerably greater than 
the energy of the harmonic oscillator 
ground state; however, this turns out 
not to seriously affect results such as 
eqn F.16. 


Note that the strength of the interaction between the atoms does not 
appear in this treatment—the value of Ti does not depend on the scat- 
tering length. This shows that BEC arises from quantum statistics. In 
real experiments there must be interactions so that atoms have a finite 
collision cross-section, otherwise there would not be any mechanism for 
establishing thermal equilibrium and evaporative cooling would not be 
possible. (A non-interacting Bose gas has some curious properties.) 


F.2.1 Bose—Einstein condensation in a harmonic 
trap 


The volume of the trapped atomic cloud depends on temperature as 
V « T°/? (from eqn 10.16); hence we find that for a trapped atom the 
equation equivalent to eqn F.11 is 


N-No «xf. (F.15) 


This dependence on the cube of T arises because the density of states for 
particles in a harmonic trap is different to that given in eqn F.10 for a 
gas in a box of fixed volume (i.e. an infinite square-well potential). This 
affects the way that the states fill up and hence the conditions for BEC. 
An argument analogous to that leading to eqn F.14 gives the fraction in 


the ground state as : 
T 
40) = (=) , (F.16) 


This is a stronger dependence on T'/Tc than in a homogeneous gas. 
At T = 0.99 T¢ this equation predicts a condensate fraction of No/N = 
0.03, so that even just below Tc a cloud of NV ~ 10° trapped atoms gives 
1/No < 1, and this partly justifies the assumptions made after eqn F.7. 
Typically, experiments are carried out at around T/T ~ 0.5, or below, 
where only a fraction (0.5)° = 0.125 of the atoms remain in the thermal 
cloud. This gives a sufficiently pure condensate for most purposes and 
further evaporative cooling would cut deeply into the condensate and 
reduce No.’ 
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principle 218-20 
magneto-optical trap (MOT) 190 4 
use with magnetic trap 223 
magnetrons 273 
masers, hydrogen 100-1, 119 
mass, effect in isotope shift 105-6 
reduced 5, 
mass spectroscopy of ions 274 
Mathieu equation 264 
matter-wave experiments 246-7, 257 
diffraction gratings 249-51 
double-slit experiment 249 
measurement of rotation 251-3 
three-grating interferometer 251-2, 
257 
matter waves 242 
de Broglie relation 246 
diffraction by light 253-5 
Raman interferometry 255-7 
mean oscillation frequency 236 
mercury 
anomalous Zeeman effect 92 
energy levels 87-8 
transitions 90 
mercury ion clock 266 
metastable states 53 
micromotion 265 
microscopy, optical tweezers 196-7 
mirror symmetry (parity) 33 
molecular potentials 229, 231 
monochromatic radiation, interaction 
with 127-32, 138 
Moseley, and atomic number 7-11 
motion-induced orientation 207 
moving molasses technique 212 
muonic atom 122 


nano-fabrication of gratings 246, 249 

neutrons, de Broglie wavelengths 246 

normal mass shift 106 

normal Zeeman effect, see Zeeman 
effect 


nuclear magnetic moments 97 

nuclear magnetic resonance (NMR), 
quantum computing 
experiments 287-9, 291 

nuclear magneton 97 

nuclear radius 106-8 

nuclear spin 97 


optical absorption by moving atoms 
155-6 
optical absorption cross-section 
138 41, 147 
for pure radiative broadening 141-2 
power broadening 143 
saturation intensity 142-3 
optical Bloch equations 137, 146 
optical frequency combs 171-4 
optical lattice 201-2 
optical molasses technique 178, 185-7 
Doppler cooling limit 188 90 
in magneto-optical trap 190—4 
use with dipole trapping 200-1 
optical pumping 140, 204-5, 207 
optical spectroscopy 
effect of Doppler broadening 153 
measurement of hyperfine structure 
113-14, 161 
measurement of Zeeman splitting 
N78) 
optical tweezers 196 7 
orbital angular momentum quantum 
number 24 
oscillating electric field, perturbation 
by 124-5 
oscillator strength 149-50 
oscillators, interaction of 299-301 


a-pulse duration 312-13 
m-pulses and 2/2-pulses 128 
ax-transitions 31 
p-series 34 
p-wave scattering 231 
parallelism in quantum computing 
289-90 
parity 32-4 
partial electrostatic potential 303-4 
Paschen—Back effect 91, 93-4 
Paul trap 261-2, 271-2 
use In quantum computing 282 
Pauli exclusion principle 46, 60 
pendulum, inverted 264 
Penning trap 271-3 
magnetic moment of electron 274-5 
mass spectroscopy of ions 274 
periodic table 60-1 
perturbation by oscillating electric 
field 124-5 
perturbation theory 
interaction of classical oscillators of 
similar frequencies 299 301 
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mathematics of 298-9 
photo-ionization 145 
photoelectric effect 145-6 
photons 

anti-bunching 146 

statistical mechanics 315-16 
pinch coils 222-4 
Planck distribution law 13 
polarizability 197 
polarization 15-16, 29-31, 91, 127, 

140-1 
gradient 204-7 
positron, magnetic moment 275 
power broadening 143 
precession 37 
pressure broadening, see collision 
broadening 

principal quantum number 5 

probability density 9, 62, 99 

probe beam, saturated absorption 
spectroscopy 157-9 

projection theorem, see Wigner—Eckart 
theorem 

pulsed lasers 172 

pump beam, saturated absorption 
spectroscopy 157-9 


quadrupole interaction 102 
quadrupole magnetic fields 220-1 
in linear Paul trap 263 
quantisation 145-6 
quantum computing 282, 291, 293-4 
decoherence 292 
logic gates 287 9 
parallelism 289 90 
qubits 283-6 
quantum defect 62-4, 68 
quantum electrodynamics (QED) 40 
in bound systems 274 -7 
quantum error correction (QEC) 292 3 
quantum harmonic oscillator, ground 
state 235 
quantum jumps in ions 269-70 
quantum number, dependence of 
energy on 61 
quantum scattering 229-34 
quasi-electrostatic traps (QUEST) 125 
qubits 283-4, 291 
distinguishability 285 
entanglement 284-6 


Rabi frequency 124 
at saturation 143 
Rabi oscillations 128 
Rabi, Isador 119 
radial confinement in magnetic traps 
220-1 
radial solution, Schrédinger equation 
26-9 
radiation 
black body 13, 316 
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excitation probability function 125 
interaction of atoms with 123-4, 
146-7 
a.c. Stark effect 144 -5 
Kinstein B coefficients 126-7 
monochromatic radiation 127-32 
optical absorption cross-section 
138-41 
power broadening 143 
saturation intensity 142-3 
optical Bloch equations 137 
perturbation by oscillating electric 
field 124-5 
radiative damping 134-7, 146-7 
Ramsey fringes 132-4 
rotating-wave approximation 125 
semiclassical treatment 145-6 
quantisation 146 
radiation emission, Einstein A and B 
coefficients 11-13 
radiation force 179 
radiation pressure 179 
radiative damping 134, 146-7 
of classical dipole 135-6 
radiative decay 11 
raising operator 24 
Raman cooling 210-11 
Raman interferometry 255-7 
Raman scattering 164 
Raman transitions 310-13 
velocity selection 208-10 
Ramsey fringes 132-4 
in atomic fountains 212-13 
random recoil in laser cooling 188-9 
random telegraph signal 269 
rate equations 12, 146-7 
recoil limit 208 
wavelength of atoms and photons 
231 Me ce 
recoil velocity 180-1 Ses 
red frequency detuning 185, 267 ‘. ee 
reduced mass 232-3 ; 
reflection 33 
of atoms by evanescent wave 201-2 
resultant force 194-5 
refraction, resultant force 194-5 
refractive index 195-6 
relative frequency, laser calibration 
168-9 
relativistic effects 5-7, 36 
residual electrostatic interaction 80 
restoring force 14 
Ritz combination principle 2 
Rontgen 14 
rotating-wave approximation 125 
rotation 
measurement in atom interferometry 
290-3 
parity 33 
rubidium, chirp cooling 184 


runaway evaporation 225 

Russell-Saunders coupling, see 
LS-coupling scheme 

Rydberg atoms 11 

Rydberg constant 2, 5 


o-transitions 31 
s-electrons 
hyperfine structure 97-100 
radial functions 28 
s-series 34 
s-wave 231 
s-p transition, absorption of light 141 
saddle, rotating 262 
Sagnac interferometer 251 
saturated absorption spectroscopy 155 
cross-over resonances 159-60 
line shape 307-9 
of atomic hydrogen 159-63 
principle 156-9 
saturation intensity 142-3 
scattering amplitude 130 
scattering force 179-82 
theory of 199 
scattering length 231-3 
for sodium 236 
Schrédinger equation 22-3 
angular part 23-6 
for alkalis 68-71 
for helium 45 8 
inclusion of interaction between 
atoms 234 
numerical solution 69 
radial part 26-9 
time-dependent 123-4 
Schrodinger’s cat 249, 251 
screening 
alkali atoms 64-8, 74-5, 81-2 
helium atom 55 
hyperfine structure 102-4 
X-rays 9 
second, definition of 118 
second-order Doppler effect in 
two-photon spectroscopy 
167 
selection rules 30-2, 42, 90 
for F 116-18, 305 
for 7 42 
for J 90 
for | 32 
for L 90 
for Mp 116-18, 305 
for My; 288 
for M; 90-1 
for m; 30-2 
for My; 96 
for Ms 96 
for S 52 
parity 32-4 
summary 90, 305 


see also transitions 
self-consistent solutions 70-1 
semiclassical theory 145-6 
semiconductor diode lasers 168 
shape of Bose-Einstein condensates 
237, 239 
shell structure of electrons 7-10 
and periodic table 60-1 
shell, definition 61 
‘shooting’ method, see Schrodinger 
equation, numerical solution 
sideband cooling 277-9 
silicon, L.S-coupling scheme 81-2 
Sisyphus cooling technique 178, 203-7 
limit 207-8 
Slater determinant 71 
slowly-varying envelope approximation 
i) 
sodium 
chirp cooling 184 
collimation, effect on Doppler 
broadening 153-4 
de Broglie wavelength 246 
diffraction patterns 249-50 
dipole trapping 200-1 
fine structure 74 
fine-structure transitions 74-5 
interaction with polarized beam 
140-1 
probability density of electrons 61-2 
properties of a BEC condensate 236 
recoil limit 208 
slowing 181 
solid-state lasers, use in dipole 
trapping 200 
Sommerfeld 34 
theory of electron orbits 6 
sound, speed of 239 
specific mass shift 106 
spectroscopy 
history 1-2 
use of optical pumping 207 
see also Doppler-free laser 
spectroscopy; optical 
spectroscopy 
spherical harmonics 
expansion of 1/rig 55 
table 25 
spin eigenstates, helium 51-2 
spin of electrons 35-6 
in helium 46 
spin-orbit interaction 36-8, 101 
in alkalis 71-3 
jj-coupling scheme 84—6 
LS-coupling scheme 83-4 
spin-spin interactions 89 
standing waves in atom interferometry 
253-5 
Stark effect 144-5 
statistical mechanics 


Bose-Einstein condensation 316-18 
of photons 315-16 
Stern—Gerlach experiment 35, 114-15 
stopping distance 181-2 
sub-Doppler cooling 190, 203-4 
motion-induced orientation 207 
see also Sisyphus cooling technique 
sub-recoil cooling 
Raman cooling 210-11 
velocity-selective coherent 
population trapping 211 
see also evaporative cooling 
sub-shell, definition 61 
superconducting magnetic traps 219 
superfluidity 239-40 
healing length 240 
symmetric wavefunctions 48-52 
sympathetic cooling 266-7 
synchrotrons 10-11 


tellurium, use in laser calibration 
169-70 
temperature 
critical in BEC 237 
determination in ions 269 
meaning of 208 
terms, in L.S-coupling scheme 81 
Thomas precession factor 37, 101 
Thomas—Fermi regime 235-7 
Thompson, J. J. 14 
three-grating interferometer 251-2 
comparison with Raman 
interferometer 257 
time dilation 167 


time-dependent perturbation theory 
(TDPT) 29, 123 
tin, Doppler-free laser spectroscopy 114 
transformations, parity 33 
transit-time broadening 154 
transit time in two-photon 
spectroscopy 165 
transitions 
in alkaline earths 90 
in alkalis 74-5 
in helium 52-3 
in hydrogen atom 29-30 
parity 32-4 
selection rules 30-2 
trapped atoms, harmonic potential 235 
trapping 
effect on transition frequency 
measurement 211 12 
see also ion trapping; magnetic 
traps; magneto-optical trap 
triangle rule 42 
tunable lasers 168 
two-level atom 123-4 
two-photon spectroscopy 163-8 
two-photon transition 313-14 


units, inside back cover 
atomic 18-19 
uranium, ionization of 276 


vacuum fluctuations 40 

vector model 37, 83-4, 90 

vector operator 83 4, 90 

velocity selection by Raman transitions 
208-10 
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velocity-selective coherent population 
trapping 211 

vibrational energy levels 277-9 

volume shift between isotopes 106-8 


wave-particle duality 246 
wavefunctions 

angular 25-6 

radial 27-9 

symmetric and antisymmetric 48-52 
wavenumbers 2 
Wigner—Eckart theorem 83 


X-ray spectra 7-11 
X-rays, discovery of 14 


Young’s double-slit experiment 133-4, 
246-8 
ytterbium ion transitions 270 


Zeeman effect 13-16 
and hyperfine structure 108 9 
intermediate fields 111 
of hydrogen 109-10, 112-13 
strong fields 110-12 
weak fields 109-10, 112 
anomalous 35 
experimental observation 17-18 
in alkalis 75 
in LS-coupling scheme 90 2 
normal 13-16, 35, 91 
Zeeman slowing 182—4 
Zeeman sub-levels 93 


Fundamental constants 


Bohr radius for hydrogen 
Velocity of light in free space 
Electronic charge 
Gravitational constant 
Planck’s constant 


Boltzmann’s constant 
Electron rest mass 
Pronton rest mass 
Avogadro’s number 
Molar gas constant 
Rydberg’s constant 


Standard molar volume 

Fine structure constant 

Electric permittivity of free space 
Bohr magneton 

Nuclear magneton 

Magnetic flux quantum 

Stefan’s constant 

Magnetic permeability of free space 
Proton magnetic moment 

Neutron magnetic moment 
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c 2.9979 x 108 ms~! 
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h 6.626 x 1074 Js 
h/2n = h 1.0546 x 10-4 Js 

kp 13807 10-4 1K 

Me 9.109 x 1073! kg 
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Na 6.022 x 1073 mol! 

R 8.315J mol~! K~! 
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hcRx 13.606eV 
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Ey 8.854 x 107! Fm7! 

LLB 9.274 x 10774 A m* or JT~! 

LN 5.051 x 10-*? Am- or JT 
h/2e = Do 2.0678 x 10—!5 Tm? 

oO 5.671 x 10° War? kK 

Lo 4x x 1077 Hm7! 

Lp 2.7928 UN 
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Energy equivalents for photons E = hf = kgT = Se 


E ¥ i rt A 

(J) (eV) (Hz) (K) (m) (m7!) (cm!) 

1 6.242 x 10'8 1.509x 10°32 7.243 x 1072 1.486 x 10725 5.034 x 1024 = 5.034 x 102” 
1.602 x 107!9 1 2.418 x 10'* 1160x104 1.240x10-° 8.066x10° 8.066 x 10° 
6.626 x 10734 4.136 x 107!5 1 4.799x107!! 2.998x 108  3.336x 107? 3.336 x 107!! 
1.381 x 107-73. 8617x1075 2.084 x 10!° 1 1.439 x 10-2 69.50 0.6950 
1.987 x 10725. 1.240x 107 2.998108 1.439 x 107? 1 1.0 0.01 
1.987 x 10725 1.240x 1076 2.998x 108 1.439 x 10-7 1.0 1 0.01 
1.987 x 10723 1.240 x 1074 2.998 x 10!° 1.439 0.01 100 1 
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